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NOVEL NUCLEIC ACIDS AND POLYPEPTIDES 

1. TECHNICAL FIELD 

4 The present invention provides novel polynucleotides and proteins encoded by such 

polynucleotides, along with uses for these polynucleotides and proteins, for example in 
therapeutic, diagnostic and research methods. 

8 2. BACKGROUND 

Technology aimed at the discovery of protein factors (including e.g., cytokines, such as 
lymphokines, interferons, CSFs, chemokines, and interleukins) has matured rapidly over the past 
decade. The now routine hybridization cloning and expression cloning techniques clone novel 

12 polynucleotides "directly" in the sense that they rely on information directly related to the 
discovered protein (z.e., partial DNA/amino acid sequence of the protein in the case of 
hybridization cloning; activity of the protein in the case of expression cloning). More recent 
"indirect" cloning techniques such as signal sequence cloning, which isolates DNA sequences 

16 based on the presence of a now well-recognized secretory leader sequence motif, as well as 

various PCR-based or low stringency hybridization-based cloning techniques, have advanced the 
state of the art by making available large numbers of DNA/amino acid sequences for proteins 
that axe known to have biological activity, for example, by virtue of their secreted nature in the 

20 case of leader sequence cloning, by virtue of their cell or tissue source in the case of PCR-based 
techniques, or by virtue of structural similarity to other genes of known biological activity. 

Identified polynucleotide and polypeptide sequences have numerous applications in, for 
example, diagnostics, forensics, gene mapping; identification of mutations responsible for 

24 genetic disorders or other traits, to assess biodiversity, and to produce many other types of data 
and products dependent on DNA and amino acid sequences. 

3. SUMMARY OF THE INVENTION 

28 The compositions of the present invention include novel isolated polypeptides, novel 

isolated polynucleotides encoding such polypeptides, including recombinant DNA molecules, 
cloned genes or degenerate variants thereof, especially naturally occurring variants such as allelic 
variants, antisense polynucleotide molecules, and antibodies that specifically recognize one or more 

3 2 epitopes present on such polypeptides, as well as hybridomas producing such antibodies. 

The compositions of the present invention additionally include vectors, including expression 
vectors, containing the polynucleotides of the invention, cells genetically engineered to contain such 
polynucleotides and cells genetically engineered to express such polynucleotides. 
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The present invention relates to a collection or library of at least one novel nucleic acid 
sequence assembled from expressed sequence tags (ESTs) isolated mainly by sequencing by 
hybridization (SBH), and in some cases, sequences obtained from one or more public databases. 
4 The invention relates also to the proteins encoded by such polynucleotides, along with therapeutic, 
diagnostic and research utilities for these polynucleotides and proteins. These nucleic acid 
sequences are designated as SEQ ID NOS: 1 -1 3901 . The polypeptides sequences are designated 
SEQ ID NOS: 1 3902-27802. The nucleic acids and polypeptides are provided in the Sequence 
8 Listing. In the nucleic acids provided in the Sequence Listing, A is adenosine; C is cytosine; G is 
guanine; T is thymine; and N is any of the four bases. In the amino acids provided in the Sequence 
Listing, * corresponds to the stop codon. 

The nucleic acid sequences of the present invention also include, nucleic acid sequences that 
1 2 hybridize to the complement of SEQ ID NO: 1 - 1 3 90 1 under stringent hybridization conditions; 
nucleic acid sequences which are allelic variants or species homologues of any of the nucleic acid 
sequences recited above, or nucleic acid sequences that encode a peptide comprising a specific 
domain or truncation of the peptides encoded by SEQ ID NOS: 1 -1 3901 . A polynucleotide 
1 6 comprising a nucleotide sequence having at least 90% identity to an identifying sequence of SEQ 
ID NO : 1 - 1 3 90 1 or a degenerate variant or fragment thereof. The identifying sequence can be 1 00 
base pairs in length. 

The nucleic acid sequences of the present invention also include the sequence information 

20 from the nucleic acid sequences of SEQ ID NO: 1-13901. The sequence information can be a 
segment of any one of SEQ ID NO: 1 - 1 3 90 1 that uniquely identifies or represents the sequence 
information ofSEQ ID NOS: 1-13901. 

A collection as used in this application can be a collection of only one polynucleotide. The 

24 collection of sequence information or identifying information of each sequence can be provided on 
a nucleic acid array. In one embodiment, segments of sequence information is provided on a 
nucleic acid array to detect the polynucleotide that contains the segment. The array can be designed 
to detect full-match or mismatch to the polynucleotide that contains the segment. The collection 

28 can also be provided in a computer-readable format. 

This invention also includes the reverse or direct complement of any of the nucleic acid 
sequences recited above; cloning or expression vectors containing the nucleic acid sequences; and 
host cells or organisms transformed with these expression vectors. Nucleic acid sequences (or their 

32 reverse or direct complements) according to the invention have numerous applications in a variety 
of techniques known to those skilled in the art of molecular biology, such as use as hybridization 
probes, use as primers for PCR, use in an array, use in computer-readablemedia, use in sequencing 
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full -length genes, use for chromosome and gene mapping, use in the recombinant production of 
protein, and use in the generation of anti -sense DNA or RNA, their chemical analogs and the like. 
In a preferred embodiment, the nucleic acid sequences of SEQ ID NO: 1 -1 3901 or novel 

4 segments or parts of the nucleic acids of the invention are used as primers in expression assays that 
are well known in the art. In a particularly preferred embodiment, the nucleic acid sequences of 
SE Q ID NO : 1 - 1 3 90 1 or novel segments or parts of the nucleic acids provided herein are used in 
diagnostics for identifying expressed genes or, as well known in the art and exemplified by Vollrath 

8 et al., Science 258:52-59 (1 992), as expressed sequence tags for physical mapping of the human 
genome. 

The isolated polynucleotides of the invention include, but are not limited to, a 
polynucleotide comprising any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901; a 

1 2 polynucleotide comprising any of the full length protein coding sequences of SEQ ID NOS : 1 - 
13901; and a polynucleotide comprising any of the nucleotide sequences of the mature protein 
coding sequences of SEQ ID NOS: 1-13901 . The polynucleotides of the present invention also 
include, but are not limited to, a polynucleotide that hybridizes under stringent hybridization 

1 6 conditions to (a) the complement of any one of the nucleotide sequences set forth in SEQ ID NO : 1 - 
1 3 90 1 ; (b) a nucleotide sequence encoding any one of the amino acid sequences set forth in the 
Sequence Listing (e.g. 9 SEQ ID NOS: 1 3902-27802); (c) a polynucleotide which is an allelic variant 
of any polynucleotides recited above; (d) a polynucleotide which encodes a species homolog (e.g. 

20 orthologs) of any of the proteins recited above; or (e) a polynucleotide that encodes a polypeptide 
comprising a specific domain or truncation of any of the polypeptides comprising an amino acid 
sequence set forth in the Sequence Listing. 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 

24 comprising any of the amino acid sequences set forth in the Sequence Listing; or the corresponding 
full length or mature protein. Polypeptides of the invention also include polypeptides with 
biological activity that are encoded by (a) any of the polynucleotides having a nucleotide sequence 
set forth in SEQ ID NO: 1-13901 ; or (b) polynucleotides that hybridize to the complement of the 

28 polynucleotides of (a) under stringent hybridization conditions. Biologically or immunologically 
active variants of any of the polypeptide sequences in the Sequence Listing, and "substantial 
equivalents" thereof (e.g. , with at least about 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% 
amino acid sequence identity) that preferably retain biological activity are also contemplated. The 

32 polypeptides of the invention may be wholly or partially chemically synthesized but are preferably 
produced by recombinant means using the genetically engineered cells (e.g. host cells) of the 
invention. 
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The invention also provides compositions comprising a polypeptide of the invention. 
Polypeptide compositions of the invention may further comprise an acceptable carrier, such as a 
hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 
4 The invention also provides host cells transformed or transfected with a polynucleotide of 

the invention. 

The invention also relates to methods for producing a polypeptide of the invention 
comprising growing a culture of the host cells of the invention in a suitable culture medium 
8 under conditions permitting expression of the desired polypeptide, and purifying the polypeptide 
from the culture or from the host cells. Preferred embodiments include those in which the 
protein produced by such process is a mature form of the protein. 

Polynucleotides according to the invention have numerous applications in a variety of 
12 techniques known to those skilled in the art of molecular biology. These techniques include use 
as hybridization probes, use as oligomers, or primers, for PGR, use for chromosome and gene 
mapping, use in the recombinant production of protein, and use in generation of anti-sense DNA 
or RNA, their chemical analogs and the like. For example, when the expression of an mRNA is 
16 largely restricted to a particular cell or tissue type, polynucleotides of the invention can be used 
as hybridization probes to detect the presence of the particular cell or tissue mRNA in a sample 
using, e.g., in situ hybridization. 

In other exemplary embodiments, the polynucleotides are used in diagnostics as 
20 expressed sequence tags for identifying expressed genes or, as well known in the art and 
exemplified by Vollrath et aL, Science 258:52-59 (1992), as expressed sequence tags for 
physical mapping of the human genome. 

The polypeptides according to the invention can be used in a variety of conventional 
24 procedures and methods that are currently applied to other proteins. For example, a polypeptide 
of the invention can be used to generate an antibody that specifically binds the polypeptide. Such 
antibodies, particularly monoclonal antibodies, are useful for detecting or quantitating the 
polypeptide in tissue. The polypeptides of the invention can also be used as molecular weight 
28 markers, and as a food supplement. 

Methods are also provided for preventing, treating, or ameliorating a medical condition 
which comprises the step of administering to a mammalian subject a therapeutically effective 
amount of a composition comprising a polypeptide of the present invention and a 
32 pharmaceutically acceptable carrier. 

In particular, the polypeptides and polynucleotides of the invention can be utilized, for 
example, in methods for the prevention and/or treatment of disorders involving aberrant protein 
expression or biological activity. 
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The present invention further relates to methods for detecting the presence of the 
polynucleotides or polypeptides of the invention in a sample. Such methods can, for example, be 
utilized as part of prognostic and diagnostic evaluation of disorders as recited herein and for the 
4 identification of subjects exhibiting a predisposition to such conditions. The invention provides 
a method for detecting the polynucleotides of the invention in a sample, comprising contacting 
the sample with a compound that binds to and forms a complex with the polynucleotide of 
interest for a period sufficient to form the complex and under conditions sufficient to form a 
8 complex and detecting the complex such that if a complex is detected, the polynucleotide of 
interest is detected. The invention also provides a method for detecting the polypeptides of the 
invention in a sample comprising contacting the sample with a compound that binds to and forms 
a complex with the polypeptide under conditions and for a period sufficient to form the complex 
12 and detecting the formation of the complex such that if a complex is formed, the polypeptide is 
detected. 

The invention also provides kits comprising polynucleotide probes and/or monoclonal 
antibodies, and optionally quantitative standards, for carrying out methods of the invention. 
16 Furthermore, the invention provides methods for evaluating the efficacy of drugs, and 

monitoring the progress of patients, involved in clinical trials for the treatment of disorders as 
recited above. 

The invention also provides methods for the identification of compounds that modulate 

20 (i.e., increase or decrease) the expression or activity of the polynucleotides and/or polypeptides 
of the invention. Such methods can be utilized, for example, for the identification of compounds 
that can ameliorate symptoms of disorders as recited herein. Such methods can include, but are 
not limited to, assays for identifying compounds and other substances that interact with (e.g., 

24 bind to) the polypeptides of the invention. The invention provides a method for identifying a 
compound that binds to the polypeptides of the invention comprising contacting the compound 
with a polypeptide of the invention in a cell for a time sufficient to form a 
polypeptide/compound complex, wherein the complex drives expression of a reporter gene 

28 sequence in the cell; and detecting the complex by detecting the reporter gene sequence 

expression such that if expression of the reporter gene is detected the compound the binds to a 
polypeptide of the invention is identified. 

The methods of the invention also provides methods for treatment which involve the 

32 administration of the polynucleotides or polypeptides of the invention to individuals exhibiting 
symptoms or tendencies. In addition, the invention encompasses methods for treating diseases or 
disorders as recited herein comprising administering compounds and other substances that 
modulate the overall activity of the target gene products. Compounds and other substances can 
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effect such modulation either on the level of target gene/protein expression or target protein 
activity. 

The polypeptides of the present invention and the polynucleotides encoding them are also 
4 useful for the same functions known to one of skill in the art as the polypeptides and 

polynucleotides to which they have homology (set forth in Table 2). If no homology is set forth 
for a sequence, then the polypeptides and polynucleotides of the present invention are useful for 
a variety of applications, as described herein, including use in arrays for detection. 

8 

4. DETAILED DESCRIPTION OF THE INVENTION 

12 4.1 DEFINITIONS 

It must be noted that as used herein and in the appended claims, the singular forms "a", 
"an" and "the" include plural references unless the context clearly dictates otherwise. 

The term "active" refers to those forms of the polypeptide which retain the biologic 

16 and/or immunologic activities of any naturally occurring polypeptide. According to the 

invention, the terms "biologically active" or "biological activity" refer to a protein or peptide 
having structural, regulatory or biochemical functions of a naturally occurring molecule. 
Likewise "immunologically active" or "immunological activity" refers to the capability of the 

20 natural, recombinant or synthetic polypeptide to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The term "activated cells" as used in this application are those cells which are engaged in 
extracellular or intracellular membrane trafficking, including the export of secretory or 

24 enzymatic molecules as part of a normal or disease process. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence 5'-AGT-3' binds to the 
complementary sequence 3'-TCA-5\ Complementarity between two single-stranded molecules 

28 may be "partial" such that only some of the nucleic acids bind or it may be "complete" such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between the nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. 

32 The term "embryonic stem cells (ES)" refers to a cell that can give rise to many 

differentiated cell types in an embryo or an adult, including the germ cells. The term "germ line 
stem cells (GSCs)" refers to stem cells derived from primordial stem cells that provide a steady 
and continuous source of germ cells for the production of gametes. The term "primordial germ 
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cells (PGCs)" refers to a small population of cells set aside from other cell lineages particularly 
from the yolk sac, mesenteries, or gonadal ridges during embryogenesis that have the potential to 
differentiate into germ cells and other cells. PGCs are the source from which GSCs and ES cells 
4 are derived The PGCs, the GSCs and the ES cells are capable of self-renewal. Thus these cells 
not only populate the germ line and give rise to a plurality of terminally differentiated cells that 
comprise the adult specialized organs, but are able to regenerate themselves. 

The term "expression modulating fragment," EMF, means a series of nucleotides which 
8 modulates the expression of an operably linked ORF or another EMF. 

As used herein, a sequence is said to "modulate the expression of an operably linked 
sequence" when the expression of the sequence is altered by the presence of the EMF. EMFs 
include, but are not limited to, promoters, and promoter modulating sequences (inducible 

12 elements). One class of EMFs are nucleic acid fragments which induce the expression of an 
operably linked ORF in response to a specific regulatory factor or physiological event. 

The terms "nucleotide sequence" or "nucleic acid" or "polynucleotide" or 
"oligonculeotide" are used interchangeably and refer to a heteropolymer of nucleotides or the 

16 sequence of these nucleotides. These phrases also refer to DNA or RNA of genomic or synthetic 
origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA) or to any DNA-like or RNA-like material. In the 
sequences herein A is adenine, C is cytosine, T is thymine, G is guanine and N is A, C, G or T 

20 (U). It is contemplated that where the polynucleotide is RNA, the T (thymine) in the sequences 
provided herein is substituted with U (uracil). Generally, nucleic acid segments provided by this 
invention may be assembled from fragments of the genome and short oligonucleotide linkers, or 
from a series of oligonucleotides, or from individual nucleotides, to provide a synthetic nucleic 

24 acid which is capable of being expressed in a recombinant transcriptional unit comprising 
regulatory elements derived from a microbial or viral operon, or a eukaryotic gene. 

The terms "oligonucleotide fragment" or a "polynucleotide fragment", "portion," or 
"segment" or "probe" or "primer" are used interchangeably and refer to a sequence of nucleotide 

28 residues which are at least about 5 nucleotides, more preferably at least about 7 nucleotides, 
more preferably at least about 9 nucleotides, more preferably at least about 1 1 nucleotides and 
most preferably at least about 17 nucleotides. The fragment is preferably less than about 500 
nucleotides, preferably less than about 200 nucleotides, more preferably less than about 100 

32 nucleotides, more preferably less than about 50 nucleotides and most preferably less than 30 
nucleotides. Preferably the probe is from about 6 nucleotides to about 200 nucleotides, 
preferably from about 15 to about 50 nucleotides, more preferably from about 17 to 30 
nucleotides and most preferably from about 20 to 25 nucleotides. Preferably the fragments can 
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be used in polymerase chain reaction (PGR), various hybridization procedures or microarray 
procedures to identify or amplify identical or related parts of mRNA or DNA molecules. A 
fragment or segment may uniquely identify each polynucleotide sequence of the present 
4 invention. Preferably the fragment comprises a sequence substantially similar to any one of SEQ 
IDNOs: 1-13901 . 

Probes may, for example, be used to determine whether specific mRNA molecules are 
present in a cell or tissue or to isolate similar nucleic acid sequences from chromosomal DNA as 
8 described by Walsh et al. (Walsh, P.S. et al., 1 992, PGR Methods Appl 1 :241-250). They may 
be labeled by nick translation, Klenow fill-in reaction, PCR, or other methods well known in the 
art. Probes of the present invention, their preparation and/or labeling are elaborated in 
Sambrook, J. et aL, 1989, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
12 Laboratory, NY; or Ausubel, F.M. et al., 1989, Current Protocols in Molecular Biology, John 
Wiley & Sons, New York NY, both of which are incorporated herein by reference in their 
entirety. 

The nucleic acid sequences of the present invention also include the sequence 

16 information from the nucleic acid sequences of SEQ ID NOS: 1-13901 . The sequence 

information can be a segment of any one of SEQ ID NO: 1-13901 that uniquely identifies or 
represents the sequence information of that sequence of SEQ ID NOS: 1-13901 . One such 
segment can be a twenty-mer nucleic acid sequence because the probability that a twenty-mer is 

20 fully matched in the human genome is 1 in 300. In the human genome, there are three billion 
base pairs in one set of chromosomes. Because 4 20 possible twenty-mers exist, there are 300 
times more twenty-mers than there are base pairs in a set of human chromosomes. Using the 
same analysis, the probability for a seventeen-mer to be fully matched in the human genome is 

24 approximately 1 in 5. When these segments are used in arrays for expression studies, fifteen- 
mer segments can be used. The probability that the fifteen-mer is fully matched in the expressed 
sequences is also approximately one in five because expressed sequences comprise less than 
approximately 5% of the entire genome sequence. 

28 Similarly, when using sequence information for detecting a single mismatch, a segment can 

be a twenty-five mer. The probability that the twenty-five mer would appear in a human genome 
with a single mismatch is calculated by multiplying the probability for a full match (1h-4 25 ) times the 
increased probability for mismatch at each nucleotide position (3 x 25). The probability that an 

32 eighteen mer with a single mismatch can be detected in an array for expression studies is 

approximately one in five. The probability that a twenty-mer with a single mismatch can be 
detected in a human genome is approximately one in five. 
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The term "open reading frame," ORP ? means a series of nucleotide triplets coding for 
amino acids without any termination codons and is a sequence translatable into protein. 

The terms "operably linked" or "operably associated" refer to functionally related nucleic 
4 acid sequences. For example, a promoter is operably associated or operably linked with a coding 
sequence if the promoter controls the transcription of the coding sequence. While operably 
linked nucleic acid sequences can be contiguous and in the same reading frame, certain genetic 
elements e.g. repressor genes are not contiguously linked to the coding sequence but still control 
8 transcription/translation of the coding sequence. 

The term "pluripotent" refers to the capability of a cell to differentiate into a number of 
differentiated cell types that are present in an adult organism. A pluripotent cell is restricted in its 
differentiation capability in comparison to a totipotent cell. 
12 The terms "polypeptide" or "peptide" or "amino acid sequence" refer to an oligopeptide, 

peptide, polypeptide or protein sequence or fragment thereof and to naturally occurring or 
synthetic molecules. A polypeptide "fragment/ 1 "portion," or "segment" is a stretch of amino 
acid residues of at least about 5 amino acids, preferably at least about 7 amino acids, more 
1 6 preferably at least about 9 amino acids and most preferably at least about 1 7 or more amino 
acids. The peptide preferably is not greater than about 200 amino acids, more preferably less 
than 150 amino acids and most preferably less than 100 amino acids. Preferably the peptide is 
from about 5 to about 200 amino acids. To be active, any polypeptide must have sufficient 
20 length to display biological and/or immunological activity. 

The term "naturally occurring polypeptide" refers to polypeptides produced by cells that 
have not been genetically engineered and specifically contemplates various polypeptides arising 
from post-translational modifications of the polypeptide including;, but not limited to, 
24 acetylation, carboxylation, glycosylation, phosphorylation, lipidation and acylation. 

The term "translated protein coding portion" means a sequence which encodes for the full 
length protein which may include any leader sequence or any processing sequence. 

The term "mature protein coding sequence" means a sequence which encodes a peptide 
28 or protein without a signal or leader sequence. The "mature protein portion" means that portion 
of the protein which does not include a signal or leader sequence. The peptide may have been 
produced by processing in the cell which removes any leader/signal sequence. The mature 
protein portion may or may not include the initial methionine residue. The methionine residue 
32 may be removed from the protein during processing in the cell. The peptide may be produced 
synthetically or the protein may have been produced using a polynucleotide only encoding for 
the mature protein coding sequence. 
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The term "derivative" refers to polypeptides chemically modified by such techniques as 
ubiquitination, labeling {e.g., with radionuclides or various enzymes), covalent polymer 
attachment such as pegylation (derivatization with polyethylene glycol) and insertion or 
4 substitution by chemical synthesis of amino acids such as ornithine, which do not normally occur 
in human proteins. 

The term 'Variant" (or "analog") refers to any polypeptide differing from naturally 
occurring polypeptides by amino acid insertions, deletions, and substitutions, created using, e g., 
8 recombinant DNA techniques. Guidance in determining which amino acid residues may be 
replaced, added or deleted without abolishing activities of interest, may be found by comparing 
the sequence of the particular polypeptide with that of homologous peptides and minimizing the 
number of amino acid sequence changes made in regions of high homology (conserved regions) 

12 or by replacing amino acids with consensus sequence. 

Alternatively, recombinant variants encoding these same or similar polypeptides may be 
synthesized or selected by making use of the "redundancy" in the genetic code. Various codon 
substitutions, such as the silent changes which produce various restriction sites, may be 

1 6 introduced to optimize cloning into a plasmid or viral vector or expression in a particular 

prokaryotic or eukaryotic system. Mutations in the polynucleotide sequence may be reflected in 
the polypeptide or domains of other peptides added to the polypeptide to modify the properties of 
any part of the polypeptide, to change characteristics such as ligand-binding affinities, interchain 

20 affinities, or degradation/turnover rate. 

Preferably, amino acid "substitutions" are the result of replacing one amino acid with 
another amino acid having similar structural and/or chemical properties, z.e., conservative amino 
acid replacements. "Conservative" amino acid substitutions may be made on the basis of 

24 similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic 
nature of the residues involved. For example, nonpolar (hydrophobic) amino acids include 
alanine, leucine, isoleucine, valine, proline, phenylalanine, tryptophan, and methionine; polar 
neutral amino acids include glycine, serine, threonine, cysteine, tyrosine, asparagine, and 

28 glutamine; positively charged (basic) amino acids include arginine, lysine, and histidine; and 

negatively charged (acidic) amino acids include aspartic acid and glutamic acid. "Insertions" or 
"deletions" are preferably in the range of about 1 to 20 amino acids, more preferably 1 to 10 
amino acids. The variation allowed may be experimentally determined by systematically making 

32 insertions, deletions, or substitutions of amino acids in a polypeptide molecule using 

recombinant DNA techniques and assaying the resulting recombinant variants for activity. 

Alternatively, where alteration of function is desired, insertions, deletions or 
non-conservative alterations can be engineered to produce altered polypeptides. Such alterations 
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can, for example, alter one or more of the biological functions or biochemical characteristics of 
the polypeptides of the invention. For example, such alterations may change polypeptide 
characteristics such as ligand-binding affinities, interchain affinities, or degradation/turnover 
4 rate. Further, such alterations can be selected so as to generate polypeptides that are better suited 
for expression, scale up and the like in the host cells chosen for expression. For example, 
cysteine residues can be deleted or substituted with another amino acid residue in order to 
eliminate disulfide bridges. 
8 The terms "purified" or "substantially purified" as used herein denotes that the indicated 

nucleic acid or polypeptide is present in the substantial absence of other biological 
macromolecules, e.g., polynucleotides, proteins, and the like. In one embodiment, the 
polynucleotide or polypeptide is purified such that it constitutes at least 95% by weight, more 

12 preferably at least 99% by weight, of the indicated biological macromolecules present (but water, 
buffers, and other small molecules, especially molecules having a molecular weight of less than 
1000 daltons, can be present). 

The term "isolated" as used herein refers to a nucleic acid or polypeptide separated from 

16 at least one other component (e.g., nucleic acid or polypeptide) present with the nucleic acid or 
polypeptide in its natural source. In one embodiment, the nucleic acid or polypeptide is found in 
the presence of (if anything) only a solvent, buffer, ion, or other component normally present in a 
solution of the same. The terms "isolated" and "purified" do not encompass nucleic acids or 

20 polypeptides present in their natural source. 

The term "recombinant," when used herein to refer to a polypeptide or protein, means 
that a polypeptide or protein is derived from recombinant (e.g., microbial, insect, or mammalian) 
expression systems. "Microbial" refers to recombinant polypeptides or proteins made in 

24 bacterial or fungal (e.g. , yeast) expression systems. As a product, "recombinant microbial" 
defines a polypeptide or protein essentially free of native endogenous substances and 
unaccompanied by associated native glycosylation. Polypeptides or proteins expressed in most 
bacterial cultures, e.g., E. colt, will be free of glycosylation modifications; polypeptides or 

28 proteins expressed in yeast will have a glycosylation pattern in general different from those 
expressed in mammalian cells. 

The term "recombinant expression vehicle or vector" refers to a plasmid or phage or virus 
or vector, for expressing a polypeptide from a DNA (RNA) sequence. An expression vehicle can 

32 comprise a transcriptional unit comprising an assembly of (1) a genetic element or elements 

having a regulatory role in gene expression, for example, promoters or enhancers, (2) a structural 
or coding sequence which is transcribed into mRNA and translated into protein, and (3) 
appropriate transcription initiation and termination sequences. Structural units intended for use 
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in yeast or eukaryotic expression systems preferably include a leader sequence enabling 
extracellular secretion of translated protein by a host cell. Alternatively, where recombinant 
protein is expressed without a leader or transport sequence, it may include an amino terminal 
4 methionine residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

The term "recombinant expression system" means host cells which have stably integrated 
a recombinant transcriptional unit into chromosomal DNA or carry the recombinant 
8 transcriptional unit extrachromosomally. Recombinant expression systems as defined herein will 
express heterologous polypeptides or proteins upon induction of the regulatory elements linked 
to the DNA segment or synthetic gene to be expressed. This term also means host cells which 
have stably integrated a recombinant genetic element or elements having a regulatory role in 

12 gene expression, for example, promoters or enhancers. Recombinant expression systems as 

defined herein will express polypeptides or proteins endogenous to the cell upon induction of the 
regulatory elements linked to the endogenous DNA segment or gene to be expressed. The ceils 
can be prokaryotic or eukaryotic. 

16 The term "secreted" includes a protein that is transported across or through a membrane, 

including transport as a result of signal sequences in its amino acid sequence when it is 
expressed in a suitable host cell. "Secreted" proteins include without limitation proteins secreted 
wholly (e.g., soluble proteins) or partially (e.g. , receptors) from the cell in which they are 

20 expressed. "Secreted" proteins also include without limitation proteins that are transported 
across the membrane of the endoplasmic reticulum. "Secreted" proteins are also intended to 
include proteins containing non-typical signal sequences (e.g. Interleukin-1 Beta, see Krasney, 
P.A. and Young, P.R. (1992) Cytokine 4(2):134 -143) and factors released from damaged cells 

24 (e.g. Interleukin-1 Receptor Antagonist, see Arend, W.P. et. al. (1998) Annu. Rev. Immunol. 
16:27-55) 

Where desired, an expression vector may be designed to contain a "signal or leader 
sequence" which will direct the polypeptide through the membrane of a cell. Such a sequence 

28 may be naturally present on the polypeptides of the present invention or provided from 
heterologous protein sources by recombinant DNA techniques. 

The term "stringent" is used to refer to conditions that are commonly understood in the 
art as stringent Stringent conditions can include highly stringent conditions (i.e., hybridization 

32 to filter-bound DNA in 0.5 M NaHP0 4 , 7% sodium dodecyl sulfate (SDS), 1 mM EDTA at 

65 0 C, and washing in 0.1 X SSC/0.1% SDS at 68°C), and moderately stringent conditions (i.e., 
washing in 0.2X SSC/0.1% SDS at 42°C). Other exemplary hybridization conditions axe 
described herein in the examples. 
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In instances of hybridization of deoxy oligonucleotides, additional exemplary stringent 
hybridization conditions include washing in 6X SSC/0.05% sodium pyrophosphate at 37°C (for 
14-base oligonucleotides), 48°C (for 17-base oligos), 55°C (for 20-base oligonucleotides), and 
4 60°C (for 23-base oligonucleotides). 

As used herein, "substantially equivalent" can refer both to nucleotide and amino acid 
sequences, for example a mutant sequence, that varies from a reference sequence by one or more 
substitutions, deletions, or additions, the net effect of which does not result in an adverse 
8 functional dissimilarity between the reference and subject sequences. Typically, such a 

substantially equivalent sequence varies from one of those listed herein by no more than about 
35% (z.e., the number of individual residue substitutions, additions, and/or deletions in a 
substantially equivalent sequence, as compared to the corresponding reference sequence, divided 

12 by the total number of residues in the substantially equivalent sequence is about 0.35 or less). 
Such a sequence is said to have 65% sequence identity to the listed sequence. In one 
embodiment, a substantially equivalent, e.g., mutant, sequence of the invention varies from a 
listed sequence by no more than 30% (70% sequence identity); in a variation of this 

16 embodiment, by no more than 25% (75% sequence identity); and in a further variation of this 
embodiment, by no more than 20% (80% sequence identity) and in a further variation of this 
embodiment, by no more than 10% (90% sequence identity) and in a further variation of this 
embodiment, by no more that 5% (95% sequence identity). Substantially equivalent, e.g., 

20 mutant, amino acid sequences according to the invention preferably have at least 80% sequence 
identity with a listed amino acid sequence, more preferably at least 85% sequence identity, more 
preferably at least 90% sequence identity, more preferably at least 95% identity, more preferably 
at least 98% identity, and most preferably at least 99% identity. Substantially equivalent 

24 nucleotide sequences of the invention can have lower percent sequence identities, taking into 
account, for example, the redundancy or degeneracy of the genetic code. Preferably, nucleotide 
sequence has at least about 65% identity, more preferably at least about 75% identity, more 
preferably at least about 80% sequence identity, more preferably at least about 85% sequence 

28 identity, more preferably at least about 90% sequence identity, and most preferably at least about 
95% identity, more preferably at least about 98% sequence identity, and most preferably at least 
about 99% sequence identity. For the purposes of the present invention, sequences having 
substantially equivalent biological activity and substantially equivalent expression characteristics 

32 are considered substantially equivalent. For the purposes of determining equivalence, truncation 
of the mature sequence (e.g., via a mutation which creates a spurious stop codon) should be 
disregarded. Sequence identity may be determined, e.g., using the Jotun Hein method (Hein, J. 



13 



WO 01/64835 



PCT/US01/04927 



(1990) Methods Enzymol. 183:626-645). Identity between sequences can also be determined by 
other methods known in the art, e.g. by varying hybridization conditions. 

The term "totipotent" refers to the capability of a cell to differentiate into all of the cell 
4 types of an adult organism. 

The term "transformation" means introducing DNA into a suitable host cell so that the 
DNA is replicable, either as an extrachromosomal element, or by chromosomal integration. The 
term "transfection" refers to the taking up of an expression vector by a suitable host cell, whether 
8 or not any coding sequences are in fact expressed. The term "infection" refers to the introduction 
of nucleic acids into a suitable host cell by use of a virus or viral vector. 

As used herein, an "uptake modulating fragment," UMF, means a series of nucleotides 
which mediate the uptake of a linked DNA fragment into a cell. UMFs can be readily identified 
12 using known UMFs as a target sequence or target motif with the computer-based systems 
described below. The presence and activity of a UMF can be confirmed by attaching the 
suspected UMF to a marker sequence. The resulting nucleic acid molecule is then incubated 
with an appropriate host under appropriate conditions and the uptake of the marker sequence is 
16 determined. As described above, a UMF will increase the frequency of uptake of a linked 
marker sequence. 

Each of the above terms is meant to encompass all that is described for each, unless the 
context dictates otherwise. 

20 

4.2 NUCLEIC ACIDS OF THE INVENTION 

Nucleotide sequences of the invention are set forth in the Sequence Listing. 

The isolated polynucleotides of the invention include a polynucleotide comprising the 

24 nucleotide sequences of SEQ ID NO: 1-13901 ; a polynucleotide encoding any one of the 
peptide sequences of SEQ ID NOS: 13902-27802; and a polynucleotide comprising the 
nucleotide sequence encoding the mature protein coding sequence of the polypeptides of any one 
of SEQ ID NOS: 13902-27802. The polynucleotides of the present invention also include, but 

28 are not limited to, a polynucleotide that hybridizes under stringent conditions to (a) the 
complement of any of the nucleotides sequences of SEQ ID NO: 1-13901; (b) nucleotide 
sequences encoding any one of the amino acid sequences set forth in the Sequence Listing; (c) a 
polynucleotide which is an allelic variant of any polynucleotide recited above; (d) a 

32 polynucleotide which encodes a species homolog of any of the proteins recited above; or (e) a 
polynucleotide that encodes a polypeptide comprising a specific domain or truncation of the 
polypeptides of SEQ ID NOS: 13902-27802. Domains of interest may depend on the nature of 
the encoded polypeptide; e.g., domains in receptor-like polypeptides include ligand-binding, 
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extracellular, transmembrane, or cytoplasmic domains, or combinations thereof; domains in 
immunoglobulin-like proteins include the variable immunoglobulin-like domains; domains in 
enzyme-like polypeptides include catalytic and substrate binding domains; and domains in ligand 
4 polypeptides include receptor-binding domains. 

The polynucleotides of the invention include naturally occurring or wholly or partially 
synthetic DNA, e.g., cDNA and genomic DNA, and RNA, e.g. , mRNA. The polynucleotides 
may include all of the coding region of the cDNA or may represent a portion of the coding 
8 region of the cDNA. 

The present invention also provides genes corresponding to the cDNA sequences disclosed 
herein. The corresponding genes can be isolated in accordance with known methods using the 
sequence information disclosed herein. Such methods include the preparation of probes or primers 

1 2 from the disclosed sequence information for identification and/or amplification of genes in 

appropriate genomic libraries or other sources of genomic materials. Further 5 ? and 3 r sequence can 
be obtained using methods known in the art. For example, full length cDN A or genomic DNA that 
corresponds to any of the polynucleotides of SEQ ID NO: 1-13901 can be obtained by screening 

1 6 appropriate cDNA or genomic DNA libraries under suitable hybridization conditions using any of 
the polynucleotides of SEQ ID NO : 1-13901 or a portion thereof as a probe. Alternatively, the 
polynucleotides of SEQ ID NO : 1-13901 may be used as the basis for suitable primer (s) that allow 
identification and/or amplification of genes in appropriate genomic DNA or cDNA libraries. 

20 The nucleic acid sequences of the invention can be assembled from ESTs and sequences 

(including cDNA and genomic sequences) obtained from one or more public databases, such as 
dbEST, gbpri, and UniGene. The EST sequences can provide identifying sequence information, 
representative fragment or segment information, or novel segment information for the full-length 

24 gene. 

The polynucleotides of the invention also provide polynucleotides including nucleotide 
sequences that are substantially equivalent to the polynucleotides recited above. Polynucleotides 
according to the invention can have, e.g., at least about 65%, at least about 70%, at least about 

28 75%, at least about 80%, 81%, 82%, 83%, 84%, more typically at least about 85%, 86%, 87%, 
88%, 89%, more typically at least about 90%, 91%, 92%, 93%, 94%, and even more typically at 
least about 95%, 96%, 97%, 98%, 99%, sequence identity to a polynucleotide recited above. 

Included within the scope of the nucleic acid sequences of the invention are nucleic acid 

32 sequence fragments that hybridize under stringent conditions to any of the nucleotide sequences 
of SEQ ID NO: 1-13901, or complements thereof, which fragment is greater than about 5 
nucleotides, preferably 7 nucleotides, more preferably greater than 9 nucleotides and most 
preferably greater than 17 nucleotides. Fragments of, e.g. 15, 17, or 20 nucleotides or more that 
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are selective for {i.e. specifically hybridize to any one of the polynucleotides of the invention) 
are contemplated. Probes capable of specifically hybridizing to a polynucleotide can 
differentiate polynucleotide sequences of the invention from other polynucleotide sequences in 
4 the same family of genes or can differentiate human genes from genes of other species, and are 
preferably based on unique nucleotide sequences. 

The sequences falling within the scope of the present invention are not limited to these 
specific sequences, but also include allelic and species variations thereof. Allelic and species 
8 variations can be routinely determined by comparingthe sequence provided SEQ ID NO: 1-13901, 
a representative fragment thereof, or a nucleotide sequence at least 90% identical, preferably 95% 
identical, to SEQ ID NO: 1-1 3901 with a sequence from another isolate of the same species. 
Furthermore, to accommodate codon variability, the invention includes nucleic acid molecules 

12 coding for the same amino acid sequences as do the specific ORFs disclosed herein. In other words, 
in the coding region of an ORE, substitution of one codon for another codon that encodes the same 
amino acid is expressly contemplated. 

The nearest neighbor or homology result for the nucleic acids of the present invention, 

16 including SEQ ID NO: 1-13901 can be obtained by searching a database using an algorithm or a 
program. Preferably, a BLAST which stands for Basic Local Alignment Search Tool is used to 
search for local sequence alignments (Altshul, S.F. J Mol. Evol. 36 290-300 (1 993) and Altschul 
S.F. etal. J. Mol. Biol. 21:403-410(1990)). Alternatively a FASTA version 3 search against 

20 Genpept, using Fastxy algorithm. 

Species homologs (or orthologs) of the disclosed polynucleotides and proteins are also 
provided by the present invention. Species homologs may be isolated and identified by making 
suitable probes or primers from the sequences provided herein and screening a suitable nucleic 

24 acid source from the desired species. 

The invention also encompasses allelic variants of the disclosed polynucleotides or 
proteins; that is, naturally-occurring alternative forms of the isolated polynucleotide which also 
encode proteins which are identical, homologous or related to that encoded by the 

28 polynucleotides. 

The nucleic acid sequences of the invention are further directed to sequences which 
encode variants of the described nucleic acids. These amino acid sequence variants may be 
prepared by methods known in the art by introducing appropriate nucleotide changes into a 

32 native or variant polynucleotide. There are two variables in the construction of amino acid 

sequence variants: the location of the mutation and the nature of the mutation. Nucleic acids 
encoding the amino acid sequence variants are preferably constructed by mutating the 
polynucleotide to encode an amino acid sequence that does not occur in nature. These nucleic 
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acid alterations can be made at sites that differ in the nucleic acids from different species 
(variable positions) or in highly conserved regions (constant regions). Sites at such locations 
will typically be modified in series, e.g., by substituting first with conservative choices {e.g., 
4 hydrophobic amino acid to a different hydrophobic amino acid) and then with more distant 

choices (e.g., hydrophobic amino acid to a charged amino acid), and then deletions or insertions 
may be made at the target site. Amino acid sequence deletions generally range from about 1 to 
30 residues, preferably about 1 to 10 residues, and are typically contiguous. Amino acid 
8 insertions include amino- and/or carboxyl-terminal fusions ranging in length from one to one 
hundred or more residues, as well as intrasequence insertions of single or multiple amino acid 
residues. Intrasequence insertions may range generally from about 1 to 10 amino residues, 
preferably from 1 to 5 residues. Examples of terminal insertions include the heterologous signal 

12 sequences necessary for secretion or for intracellular targeting in different host cells and 

sequences such as FLAG or poly-histidine sequences useful for purifying the expressed protein. 

In a preferred method, polynucleotides encoding the novel amino acid sequences are 
changed via site-directed mutagenesis. This method uses' oligonucleotide sequences to alter a 

16 polynucleotide to encode the desired amino acid variant, as well as sufficient adjacent 

nucleotides on both sides of the changed amino acid to form a stable duplex on either side of the 
site of being changed. In general, the techniques of site-directed mutagenesis are well known to 
those of skill in the art and this technique is exemplified by publications such as, Edelman et al., 

20 DNA 2:1 83 (1983). A versatile and efficient method for producing site-specific changes in a 
polynucleotide sequence was published by Zoller and Smith, Nucleic Acids Res. 10:6487-6500 
(1982). PCR may also be used to create amino acid sequence variants of the novel nucleic acids. 
When small amounts of template DNA are used as starting material, primer(s) that differs 

24 slightly in sequence from the corresponding region in the template DNA can generate the desired 
amino acid variant. PCR amplification results in a population of product DNA fragments that 
differ from the polynucleotide template encoding the polypeptide at the position specified by the 
•primer. The product DNA fragments replace the corresponding region in the plasmid and this 

28 gives a polynucleotide encoding the desired amino acid variant. 

A further technique for generating amino acid variants is the cassette mutagenesis 
technique described in Wells et ah, Gene 34:315 (1985); and other mutagenesis techniques well 
known in the art, such as, for example, the techniques in Sambrook et ah, supra, and Current 

32 Protocols in Molecular Biology, Ausubel et al. Due to the inherent degeneracy of the genetic 
code, other DNA sequences which encode substantially the same or a functionally equivalent 
amino acid sequence may be used in the practice of the invention for the cloning and expression 



17 



WO 01/64835 



PCT/US01/04927 



of these novel nucleic acids. Such DNA sequences include those which are capable of 
hybridizing to the appropriate novel nucleic acid sequence under stringent conditions. 

Polynucleotides encoding preferred polypeptide truncations of the invention can be used 
4 to generate polynucleotides encoding chimeric or fusion proteins comprising one or more 
domains of the invention and heterologous protein sequences. 

The polynucleotides of the invention additionally include the complement of any of the 
polynucleotides recited above. The polynucleotide can be DNA (genomic, cDNA, amplified, or 
8 synthetic) or RNA. Methods and algorithms for obtaining such polynucleotides are well known 
to those of skill in the art and can include, for example, methods for determining hybridization 
conditions that can routinely isolate polynucleotides of the desired sequence identities. 

In accordance with the invention, polynucleotide sequences comprising the mature 

12 protein coding sequences corresponding to any one of SEQ ID NO: 1-13901 , or functional 
equivalents thereof, may be used to generate recombinant DNA molecules that direct the 
expression of that nucleic acid, or a functional equivalent thereof, in appropriate host cells. Also 
included are the cDNA inserts of any of the clones identified herein. 

16 A polynucleotide according to the invention can be joined to any of a variety of other 

nucleotide sequences by well-established recombinant DNA techniques (see Sambrook J et al. 
(1989) Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, NY). Useful 
nucleotide sequences for joining to polynucleotides include an assortment of vectors, e.g., 

20 plasmids, cosmids, lambda phage derivatives, phagemids, and the like, that are well known in the 
art. Accordingly, the invention also provides a vector including a polynucleotide of the 
invention and a host cell containing the polynucleotide. In general, the vector contains an origin 
of replication functional in at least one organism, convenient restriction endonuclease sites, and a 

24 selectable marker for the host cell. Vectors according to the invention include expression 
vectors, replication vectors, probe generation vectors, and sequencing vectors. A host cell 
according to the invention can be a prokaryotic or eukaryotic cell and can be a unicellular 
organism or part of a multicellular organism. 

28 The present invention further provides recombinant constructs comprising a nucleic acid 

having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof or any 
other polynucleotides of the invention. In one embodiment, the recombinant constructs of the 
present invention comprise a vector, such as a plasmid or viral vector, into which a nucleic acid 

32 having any of the nucleotide sequences of SEQ ID NO: 1-13901 or a fragment thereof is 

inserted, in a forward or reverse orientation. In the case of a vector comprising one of the ORFs 
of the present invention, the vector may further comprise regulatory sequences, including for 
example, a promoter, operably linked to the ORF. Large numbers of suitable vectors and 
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promoters are known to those of skill in the art and are commercially available for generating the 
recombinant constructs of the present invention. The following vectors are provided by way of 
example. Bacterial: pBs, phagescript, PsiX174, pBluescript SK, pBs KS, pNH8a, pNH16a, 
4 pNHl 8a, pNH46a (Stratagene); P Trc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (Pharmacia). 
Eukaryotic: pWLneo, pSV2cat, pOG44, PXTI, pSG (Stratagene) pSVK3, pBPV, pMSG, pSVL 
(Pharmacia). 

The isolated polynucleotide of the invention may be operably linked to an expression 
8 control sequence such as the pMT2 or pED expression vectors disclosed in Kaufman et al, 

Nucleic Acids Res. 1 9 9 4485-4490 (1 991), in order to produce the protein recombinantly. Many 
suitable expression control sequences are known in the art. General methods of expressing 
recombinant proteins are also known and are exemplified in R. Kaufman, Methods in 

12 Enzymology 185, 537-566 (1990). As defined herein "operably linked" means that the isolated 
polynucleotide of the invention and an expression control sequence are situated within a vector 
or cell in such a way that the protein is expressed by a host cell which has been transformed 
(transfected) with the ligated polynucleotide/expression control sequence. 

16 Promoter regions can be selected from any desired gene using CAT (chloramphenicol 

transferase) vectors or other vectors with selectable markers. Two appropriate vectors are 
pKK232-8 and pCM7. Particular named bacterial promoters include lad, lacZ, T3, T7, gpt, 
lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV thymidine 

20 kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein-I. Selection of 
the appropriate vector and promoter is well within the level of ordinary skill in the art. 
Generally, recombinant expression vectors will include origins of replication and selectable 
markers permitting transformation of the host cell, e.g., the ampicillin resistance gene of E. coli 

24 and S. cerevisiae TRP1 gene, and a promoter derived from a highly-expressed gene to direct 

transcription of a downstream structural sequence. Such promoters can be derived from operons 
encoding glycolytic enzymes such as 3 -phosphogly cerate kinase (PGK), a-factor, acid 
phosphatase, or heat shock proteins, among others. The heterologous structural sequence is 

28 assembled in appropriate phase with translation initiation and termination sequences, and 
preferably, a leader sequence capable of directing secretion of translated protein into the 
periplasmic space or extracellular medium. Optionally, the heterologous sequence can encode a 
fusion protein including an amino terminal identification peptide imparting desired 

32 characteristics, e.g. , stabilization or simplified purification of expressed recombinant product. 
Useful expression vectors for bacterial use are constructed by inserting a structural DNA 
sequence encoding a desired protein together with suitable translation initiation and termination 
signals in operable reading phase with a functional promoter. The vector will comprise one or 
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more phenotypic selectable markers and an origin of replication to ensure maintenance of the 
vector and to, if desirable, provide amplification within the host. Suitable prokaryotic hosts for 
transformation include E. coli, Bacillus subtilis, Salmonella typhimurium and various species 
4 within the genera Pseudomonas, Sfreptomyces, and Staphylococcus, although others may also be 
employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use 
can comprise a selectable marker and bacterial origin of replication derived from commercially 
8 available plasmids comprising genetic elements of the well known cloning vector pBR322 
(ATCC 37017). Such commercial vectors include, for example, pKK223-3 (Pharmacia Fine 
Chemicals, Uppsala, Sweden) and GEM 1 (Promega Biotech, Madison, WI, USA). These 
pBR322 "backbone" sections are combined with an appropriate promoter and the structural 

12 sequence to be expressed. Following transformation of a suitable host strain and growth of the 
host strain to an appropriate cell density, the selected promoter is induced or derepressed by 
appropriate means (e.g., temperature shift or chemical induction) and cells are cultured for an 
additional period. Cells are typically harvested by centrifogation, disrupted by physical or 

1 6 chemical means, and the resulting crude extract retained for further purification. 

Polynucleotides of the invention can also be used to induce immune responses. For 
example, as described in Fan et aL, Nat Biotech. 17:870-872 (1999), incorporated herein by 
reference, nucleic acid sequences encoding a polypeptide may be used to generate antibodies 

20 against the encoded polypeptide following topical administration of naked plasmid DNA or 
following injection, and preferably intramuscular injection of the DNA. The nucleic acid 
sequences are preferably inserted in a recombinant expression vector and may be in the form of 
naked DNA. 

24 

4.3 ANTISENSE 

Another aspect of the invention pertains to isolated antisense nucleic acid molecules that 
are hybridizable to or complementary to the nucleic acid molecule comprising the nucleotide 

28 sequence of SEQ ID NO: 1-13901 , or fragments, analogs or derivatives thereof. An "antisense" 
nucleic acid comprises a nucleotide sequence that is complementary to a "sense" nucleic acid 
encoding a protein, e.g., complementary to the coding strand of a double-stranded cDNA 
molecule or complementary to an mRNA sequence. In specific aspects, antisense nucleic acid 

32 molecules are provided that comprise a sequence complementary to at least about 10, 25, 50, 

100, 250 or 500 nucleotides or an entire coding strand, or to only a portion thereof. Nucleic acid 
molecules encoding fragments, homologs, derivatives and analogs of a protein of any of SEQ ID 
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NOS: 13902-27802 or antisense nucleic acids complementary to a nucleic acid sequence of SEQ 
ID NO: 1-13901 are additionally provided. 

In one embodiment, an antisense nucleic acid molecule is antisense to a "coding region" 
4 of the coding strand of a nucleotide sequence of the invention. The term "coding region" refers 
to the region of the nucleotide sequence comprising codons which are translated into amino acid 
residues. In another embodiment, the antisense nucleic acid molecule is antisense to a 
"noncoding region" of the coding strand of a nucleotide sequence of the invention. The term 
8 "noncoding region" refers to 5' and 3' sequences which flank the coding region that are not 
translated into amino acids (i.e., also referred to as 5 T and 3' untranslated regions). 

Given the coding strand sequences encoding a nucleic acid disclosed herein (e.g. , SEQ 
ID NO: 1-13901 ), antisense nucleic acids of the invention can be designed according to the rules 

12 of Watson and Crick or Hoogsteen base pairing. The antisense nucleic acid molecule can be 

complementary to the entire coding region of a mRNA, but more preferably is an oligonucleotide 
that is antisense to only a portion of the coding or noncoding region of a mRNA. For example, 
the antisense oligonucleotide can be complementary to the region surrounding the translation 

16 start site of a mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 
30, 35, 40, 45 or 50 nucleotides in length. An antisense nucleic acid of the invention can be 
constructed using chemical synthesis or enzymatic ligation reactions using procedures known in 
the art. For example, an antisense nucleic acid (e.g., an antisense oligonucleotide) can be 

20 chemically synthesized using naturally occurring nucleotides or variously modified nucleotides 
designed to increase the biological stability of the molecules or to increase the physical stability 
of the duplex formed between the antisense and sense nucleic acids, e.g., phosphorothioate 
derivatives and acridine substituted nucleotides can be used. 

24 Examples of modified nucleotides that can be used to generate the antisense nucleic acid 

include: 5-fluorouracil, 5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 
4-acetylcytosine, 5 -(carboxyhy droxy lmethyl) uracil, 5-carboxymethylaminomethyl- 
2-thiouridine, 5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, 

28 inosine, N6-isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 

2- methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, 
beta-D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 

32 2-methylthio-N6-isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, 
queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, 
uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5~methyl-2-thiouracil, 

3- (3-amino-3-N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the 



21 



WO 01/64835 PCT/US01/04927 

antisense nucleic acid can be produced biologically using an expression vector into which a 
nucleic acid has been subcloned in an antisense orientation (Le., RNA transcribed from the 
inserted nucleic acid will be of an antisense orientation to a target nucleic acid of interest, 
4 described further in the following subsection). 

The antisense nucleic acid molecules of the invention are typically administered to a 
subject or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a protein according to the invention to thereby inhibit expression of the 
8 protein, e.g. , by inhibiting transcription and/or translation. The hybridization can be by 

conventional nucleotide complementarity to form a stable duplex, or, for example, in the case of 
an antisense nucleic acid molecule that binds to DNA duplexes, through specific interactions in 
the major groove of the double helix. An example of a route of administration of antisense 

12 nucleic acid molecules of the invention includes direct injection at a tissue site. Alternatively, 
antisense nucleic acid molecules can be modified to target selected cells and then administered 
systemically. For example, for systemic administration, antisense molecules can be modified 
such that they specifically bind to receptors or antigens expressed on a selected cell surface, e.g., 

16 by linking the antisense nucleic acid molecules to peptides or antibodies that bind to cell surface 
receptors or antigens. The antisense nucleic acid molecules can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of antisense 
molecules, vector constructs in which the antisense nucleic acid molecule is placed under the 

20 control of a strong pol II or pol III promoter are preferred. 

In yet another embodiment, the antisense nucleic acid molecule of the invention is an 
-a n omeric nucleic acid molecule. An -a nomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual -uni ts, the 

24 strands run parallel to each other (Gaultier et ah (1987) Nucleic Acids Res 15: 6625-6641). The 
antisense nucleic acid molecule can also comprise a 2 f -o-methylribonucleotide (Inoue et ah 
(1987) Nucleic Acids Res 15: 6131-6148) or a chimeric RNA -DNA analogue (Inoue et ah (1987) 
FEES Lett 215: 327-330). 

28 

4.4 RIBOZYMES AND PNA MOIETIES 

In still another embodiment, an antisense nucleic acid of the invention is a ribozyme. 
Ribozymes are catalytic RNA molecules with ribonuclease activity that are capable of cleaving a 
32 single-stranded nucleic acid, such as a mRNA, to which they have a complementary region. 
Thus, ribozymes (e.g., hammerhead ribozymes (described in Haselhoff and Gerlach (1988) 
Nature 334:585-591)) can be used to catalytically cleave a mRNA transcripts to thereby inhibit 
translation of a mRNA. A ribozyme having specificity for a nucleic acid of the invention can be 
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designed based upon the nucleotide sequence of a DNA disclosed herein (i.e., SEQ ID NO: 1- 
13901 ). For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence to 
4 be cleaved in a SECX-encoding mKNA. See, e.g. , Cech et ah U.S. Pat No. 4,987,071 ; and 
Cech et al U.S. Pat. No. 5,1 16,742. Alternatively, SECX rnRNA can be used to select a 
catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, e.g., 
Bartel et ah, (1993) Science 261:1411-1418. 
8 Alternatively, gene expression can be inhibited by targeting nucleotide sequences 

complementary to the regulatory region (e.g., promoter and/or enhancers) to form triple helical 
structures that prevent transcription of the gene in target cells. See generally, Helene. (1991) 
Anticancer Drug Des. 6: 569-84; Helene. et al. (1992) Ann. N. Y. Acad Set 660:27-36; and 

12 Maher (1992) Bioassays 14: 807-1 5. 

In various embodiments, the nucleic acids of the invention can be modified at the base 
moiety, sugar moiety or phosphate backbone to improve, e.g., the stability, hybridization, or 
solubility of the molecule. For example, the deoxyribose phosphate backbone of the nucleic 

16 acids can be modified to generate peptide nucleic acids (see Hyrup et al. (1996) Bioorg Med 

Chem 4: 5-23). As used herein, the terms "peptide nucleic acids" or "PNAs" refer to nucleic acid 
mimics, e.g., DNA mimics, in which the deoxyribose phosphate backbone is replaced by a 
pseudopeptide backbone and only the four natural nucleobases are retained. The neutral 

20 backbone of PNAs has been shown to allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. The synthesis of PNA oligomers can be performed using 
standard solid phase peptide synthesis protocols as described in Hyrup et al. (1996) above; 
Perry-O'Keefee/tf/. (1 996) PNAS 93: 14670-675. 

24 PNAs of the invention can be used in therapeutic and diagnostic applications. For 

example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of 
gene expression by, e.g., inducing transcription or translation arrest or inhibiting replication. 
PNAs of the invention can also be used, e.g., in the analysis of single base pair mutations in a 

28 gene by, e.g., PNA directed PCR clamping; as artificial restriction enzymes when used in 

combination with other enzymes, e.g., SI nucleases (Hyrup B. (1996) above); or as probes or 
primers for DNA sequence and hybridization (Hyrup et ah (1996), above; Perry-O'Keefe (1996), 
above). 

32 In another embodiment, PNAs of the invention can be modified, e.g., to enhance their 

stability or cellular uptake, by attaching lipophilic or other helper groups to PNA, by the 
formation of PNA-DNA chimeras, or by the use of liposomes or other techniques of drug 
delivery known in the art. For example, PNA-DNA chimeras can be generated that may 
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combine the advantageous properties of PNA and DNA. Such chimeras allow DNA recognition 
enzymes, e.g., RNase H and DNA polymerases, to interact with the DNA portion while the PNA 
portion would provide high binding affinity and specificity. PNA-DNA chimeras can be linked 

4 using linkers of appropriate lengths selected in terms of base stacking, number of bonds between 
the nucleobases, and orientation (Hyrap (1996) above). The synthesis of PNA-DNA chimeras 
can be performed as described in Hyrup (1996) above and Finn et al. (1996) Nucl Acids Res 24: 
3357-63. For example, a DNA chain can be synthesized on a solid support using standard 

8 phosphoramidite coupling chemistry, and modified nucleoside analogs, e.g., 

5'-(4-methoxytrityl)amino-5 r -deoxy-thymidine phosphoramidite, can be used between the PNA 
and the 5' end of DNA (Mag et al (1989) Nucl Acid Res 17: 5973-88). PNA monomers are then 
coupled in a stepwise manner to produce a chimeric molecule with a 5' PNA segment and a 3' 
12 DNA segment (Finn et al. (1 996) above). Alternatively, chimeric molecules can be synthesized 
with a 5' DNA segment and a 3' PNA segment. See, Petersen et ah (1975) BioorgMed Chem 
LettS: 1119-11124. 

In other embodiments, the oligonucleotide may include other appended groups such as 
16 peptides {e.g., for targeting host cell receptors in vivo), or agents facilitating transport across the 

cell membrane (see, e.g., Letsinger et al, 1989, Proc. Natl Acad. Sci. U.S.A. 86:6553-6556; 

Lemaitre et al, 1987, Proc Natl Acad. Set. 84:648-652; PCT Publication No. W088/09810) or 

the blood-brain barrier (see, e.g., PCT Publication No. W089/10134). In addition, 
20 oligonucleotides can be modified with hybridization triggered cleavage agents (See, e.g. , Krol et 

al., 1988, BioTechniques 6:958-976) or intercalating agents. (See, e.g., Zon, 1988, Pharm. Res. 

5: 539-549). To this end, the oligonucleotide may be conjugated to another molecule, e.g., a. 

peptide, a hybridization triggered cross-linking agent, a transport agent, a hybridization-triggered 
24 cleavage agent, etc. 

4.5 HOSTS 

The present invention further provides host cells genetically engineered to contain the 
28 polynucleotides of the invention. For example, such host cells may contain nucleic acids of the 
invention introduced into the host cell using known transformation, transfection or infection 
methods. The present invention still further provides host cells genetically engineered to express 
the polynucleotides of the invention, wherein such polynucleotides are in operative association 
32 yvith a regulatory sequence heterologous to the host cell which drives expression of the 
polynucleotides in the cell. 

Knowledge of nucleic acid sequences allows for modification of cells to permit, or 
increase, expression of endogenous polypeptide. Cells can be modified (e.g. , by homologous 
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recombination) to provide increased polypeptide expression by replacing, in whole or in part, the 
naturally occurring promoter with all or part of a heterologous promoter so that the cells express 
the polypeptide at higher levels. The heterologous promoter is inserted in such a manner that it 
4 is operatively linked to the encoding sequences. See, for example, PCT International Publication 
No. WO94/12650, PCT International Publication No. WO92/20808, and PCT International 
Publication No. WO91/09955. It is also contemplated that, in addition to heterologous promoter 
DNA, amplifiable marker DNA (e.g., ada, dhfr, and the multifunctional CAD gene which 
8 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 
intron DNA may be inserted along with the heterologous promoter DNA. If linked to the coding 
sequence, amplification of the marker DNA by standard selection methods results in co- 
amplification of the desired protein coding sequences in the cells. 

12 The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 

eukaryotic host cell, such as a yeast cell, or the host cell can be a prokaryotic cell, such as a 
bacterial cell. Introduction of the recombinant construct into the host cell can be effected by 
calcium phosphate transfection, DEAE, dextran mediated transfection, or electroporation (Davis, 

16 L. et al., Basic Methods in Molecular Biology (1986)). The host cells containing one of the 
polynucleotides of the invention, can be used in conventional manners to produce the gene 
product encoded by the isolated fragment (in the case of an ORF) or can be used to produce a 
heterologous protein under the control of the EMF. 

20 Any host/vector system can be used to express one or more of the ORFs of the present 

invention. These include, but are not limited to, eukaryotic hosts such as HeLa cells, Cv-1 cell, 
COS cells, 293 cells, and Sf9 cells, as well as prokaryotic host such as E. coli and B. subtilis. 
The most preferred cells are those which do not normally express the particular polypeptide or 

24 protein or which expresses the polypeptide or protein at low natural level. Mature proteins can 
be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appropriate 
promoters. Cell-free translation systems can also be employed to produce such proteins using • 
RNAs derived from the DNA constructs of the present invention. Appropriate cloning and 

28 expression vectors for use with prokaryotic and eukaryotic hosts are described by Sambrook, et 
al., in Molecular Cloning: A Laboratory Manual, Second Edition, Cold Spring Harbor, New 
York (1989), the disclosure of which is hereby incorporated by reference. 

Various mammalian cell culture systems can also be employed to express recombinant 

32 protein. Examples of mammalian expression systems include the COS-7 lines of monkey kidney 
fibroblasts, described by Gluzman, Cell 23:175 (1981). Other cell lines capable of expressing a 
compatible vector are, for example, the C127, monkey COS cells, Chinese Hamster Ovary 
(CHO) cells, human kidney 293 cells, human epidermal A431 cells, human Colo205 cells, 3T3 
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cells, CV-1 cells, other transformed primate cell lines, normal diploid cells, cell strains derived 
from in vitro culture of primary tissue, primary explants, HeLa cells, mouse L cells, BHK ? 
HL-60, U937, HaK or Jurkat cells. Mammalian expression vectors will comprise an origin of 

4 replication, a suitable promoter and also any necessary ribosome binding sites, polyadenylation 
site, splice donor and acceptor sites, transcriptional termination sequences, and 5 r flanking 
nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for example, 
SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide 

8 the required nontranscribed genetic elements. Recombinant polypeptides and proteins produced 
in bacterial culture are usually isolated by initial extraction from cell pellets, followed by one or 
more salting-out, aqueous ion exchange or size exclusion chromatography steps. Protein 
refolding steps can be used, as necessary, in completing configuration of the mature protein. 
12 Finally, high performance liquid chromatography (HPLC) can be employed for final purification 
steps. Microbial cells employed in expression of proteins can be disrupted by any convenient 
method, including freeze-thaw cycling, sonication, mechanical disruption, or use of cell lysing 
agents. 

16 Alternatively, it may be possible to produce the protein in lower eukaryotes such as yeast 

or insects or in prokaryotes such as bacteria. Potentially suitable yeast strains include 
Saccharomyces cerevisiae, Schizosaccharomyces pombe, Kluyveromyces strains, Candida, or 
any yeast strain capable of expressing heterologous proteins. Potentially suitable bacterial 

20 strains include Escherichia coli, Bacillus subtilis, Salmonella typhimurium, or any bacterial 

strain capable of expressing heterologous proteins. If the protein is made in yeast or bacteria, it 
may be necessary to modify the protein produced therein, for example by phosphorylation or 
glycosylation of the appropriate sites, in order to obtain the functional protein. Such covalent 

24 attachments may be accomplished using known chemical or enzymatic methods. 

In another embodiment of the present invention, cells and tissues may be engineered to 
express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene 

28 may be replaced by homologous recombination. As described herein, gene targeting can be used 
to replace a gene's existing regulatory region with a regulatory sequence isolated from a different 
gene or a novel regulatory sequence synthesized by genetic engineering methods. Such 
regulatory sequences may be comprised of promoters, enhancers, scaffold-attachment regions, 

32 negative regulatory elements, transcriptional initiation sites, regulatory protein binding sites or 
combinations of said sequences. Alternatively, sequences which affect the structure or stability 
of the RNA or protein produced may be replaced, removed, added, or otherwise modified by 
targeting. These sequence include polyadenylation signals, mRNA stability elements, splice 
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sites, leader sequences for enhancing or modifying transport or secretion properties of the 
protein, or other sequences which alter or improve the function or stability of protein or RNA 
molecules, 

A The targeting event may be a simple insertion of the regulatory sequence, placing the 

gene under the control of the new regulatory sequence, e.g., inserting a new promoter or 
enhancer or both upstream of a gene. Alternatively, the targeting event may be a simple deletion 
of a regulatory element, such as the deletion of a tissue-specific negative regulatory element. 
8 Alternatively, the targeting event may replace an existing element; for example, a tissue-specific 
enhancer can be replaced by an enhancer that has broader or different cell-type specificity than 
the naturally occurring elements. Here, the naturally occurring sequences are deleted and new 
sequences are added. In all cases, the identification of the targeting event may be facilitated by 

12 the use of one or more selectable marker genes that are contiguous with the targeting DNA, 

allowing for the selection of cells in which the exogenous DNA has integrated into the host cell 
genome. The identification of the targeting event may also be facilitated by the use of one or 
more marker genes exhibiting the property of negative selection, such that the negatively 

16 selectable marker is linked to the exogenous DNA, but configured such that the negatively 
selectable marker flanks the targeting sequence, and such that a correct homologous 
recombination event with sequences in the host cell genome does not result in the stable 
integration of the negatively selectable marker. Markers useful for this purpose include the 

20 Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial xanthine-guanine 
phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with 
this aspect of the invention are more particularly described in U.S. Patent No. 5,272,071 to 

24 Chappel; U.S. Patent No. 5,578,461 to Sherwin et al.; International Application No. 
PCT/US92/09627 (WO93/09222) by Selden et al.; and International Application No. 
PCT/US90/06436 (WO91/06667) by Skoultchi et al., each of which is incorporated by reference 
herein in its entirety. 

28 

4.6 POLYPEPTIDES OF THE INVENTION 

The isolated polypeptides of the invention include, but are not limited to, a polypeptide 
comprising: the amino acid sequences set forth as any one of SEQ ID NOS: 13902-278Q2 or an 
32 amino acid sequence encoded by any one of the nucleotide sequences SEQ ID NO: 1-13901 or 
the corresponding full length or mature protein. Polypeptides of the invention also include 
polypeptides preferably with biological or immunological activity that are encoded by: (a) a 
polynucleotide having any one of the nucleotide sequences set forth in SEQ ID NO: 1-13901 or 
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(b) polynucleotides encoding any one of the amino acid sequences set forth as SEQ ID NOS: 
13902-27802 or (c) polynucleotides that hybridize to the complement of the polynucleotides of 
either (a) or (b) under stringent hybridization conditions. The invention also provides 
4 biologically active or immunologically active variants of any of the amino acid sequences set 
forth as SEQ ID NOS: 13902-27802 or the corresponding full length or mature protein; and 
"substantial equivalents" thereof (e.g., with at least about 65%, at least about 70%, at least about 
75%, at least about 80%, at least about 85%, 86% ? 87%, 88%, 89%, at least about 90%, 91%, 
8 92%, 93%, 94%, typically at least about 95%, 96%, 97%, more typically at least about 98%, or 
most typically at least about 99% amino acid identity) that retain biological activity. 
Polypeptides encoded by allelic variants may have a similar, increased, or decreased activity 
compared to polypeptides comprising SEQ ID NOS: 13902-27802. 

12 Fragments of the proteins of the present invention which are capable of exhibiting 

biological activity are also encompassed by the present invention. Fragments of the protein may 
be in linear form or they may be cyclized using known methods, for example, as described inH. 
U. Saragovi, et aL, Bio/Technology 10, 773-778 (1992) and in R. S. McDowell, et al., J. Amer. 

16 Chem. Soc. 1 14, 9245-9253 (1 992), both of which are incorporated herein by reference. Such 
fragments may be fused to carrier molecules such as immunoglobulins for many purposes, 
including increasing the valency of protein binding sites. 

The present invention also provides both fall-length and mature forms (for example, 

20 without a signal sequence or precursor sequence) of the disclosed proteins. The protein coding 
sequence is identified in the sequence listing by translation of the disclosed nucleotide 
sequences. The mature form of such protein may be obtained by expression of a full-length 
polynucleotide in a suitable mammalian cell or other host cell. The sequence of the mature form 

24 of the protein is also determinable from the amino acid sequence of the full-length form. Where 
proteins of the present invention are membrane bound, soluble forms of the proteins are also 
provided. In such forms, part or all of the regions causing the proteins to be membrane bound 
are deleted so that the proteins are fully secreted from the cell in which they are expressed. 

28 Protein compositions of the present invention may further comprise an acceptable carrier, 

such as a hydrophilic, e.g., pharmaceutical^ acceptable, carrier. 

The present invention further provides isolated polypeptides encoded by the nucleic acid 
fragments of the present invention or by degenerate variants of the nucleic acid fragments of the 

32 present invention. By "degenerate variant" is intended nucleotide fragments which differ from a 
nucleic acid fragment of the present invention (e.g. , an ORF) by nucleotide sequence but, due to 
the degeneracy of the genetic code, encode an identical polypeptide sequence. Preferred nucleic 
acid fragments of the present invention are the ORFs that encode proteins. 
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A variety of methodologies known in the art can be utilized to obtain any one of the 
isolated polypeptides or proteins of the present invention. At the simplest level, the amino acid 
sequence can be synthesized using commercially available peptide synthesizers. The 
synthetically-constructed protein sequences, by virtue of sharing primary, secondary or tertiary 
structural and/or conformational characteristics with proteins may possess biological properties 
in common therewith, including protein activity. This technique is particularly useful in 
producing small peptides and fragments of larger polypeptides. Fragments are useful, for 
example, in generating antibodies against the native polypeptide. Thus, they may be employed 
as biologically active or immunological substitutes for natural, purified proteins in screening of 
therapeutic compounds and in immunological processes for the development of antibodies. 

The polypeptides and proteins of the present invention can alternatively be purified from 
cells which have been altered to express the desired polypeptide or protein. As used herein, a 
cell is said to be altered to express a desired polypeptide or protein when the cell, through genetic 
manipulation, is made to produce a polypeptide or protein which it normally does not produce or 
which the cell normally produces at a lower level. One skilled in the art can readily adapt 
procedures for introducing and expressing either recombinant or synthetic sequences into 
eukaryotic or prokaryotic cells in order to generate a cell which produces one of the polypeptides 
or proteins of the present invention. 

The invention also relates to methods for producing a polypeptide comprising growing a 
culture of host cells of the invention in a suitable culture medium, and purifying the protein from 
the cells or the culture in which the cells are grown. For example, the methods of the invention 
include a process for producing a polypeptide in which a host cell containing a suitable 
expression vector that includes a polynucleotide of the invention is cultured under conditions that 
allow expression of the encoded polypeptide. The polypeptide can be recovered from the 
culture, conveniently from the culture medium, or from a lysate prepared from the host cells and 
further purified. Preferred embodiments include those in which the protein produced by such 
process is a full length or mature form of the protein. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which 
naturally produce the polypeptide or protein. One skilled in the art can readily follow known 
methods for isolating polypeptides and proteins in order to obtain one of the isolated 
polypeptides or proteins of the present invention. These include, but are not limited to, 
immuno chromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, 
and immuno-affinity chromatography. See, e.g., Scopes, Protein Purification: Principles and 
Practice, Springer- Verlag (1994); Sambrook, et al., in Molecular Cloning: A Laboratory 
Manual', Ausubel et al., Current Protocols in Molecular Biology, Polypeptide fragments that 
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retain biological/immunological activity include fragments comprising greater than about 100 
amino acids, or greater than about 200 amino acids, and fragments that encode specific protein 
domains. 

4 The purified polypeptides can be used in in vitro binding assays which are well known in 

the art to identify molecules which bind to the polypeptides. These molecules include but are not 
limited to, for e.g., small molecules, molecules from combinatorial libraries, antibodies or other 
proteins. The molecules identified in the binding assay are then tested for antagonist or agonist 
8 activity in in vivo tissue culture or animal models that are well known in the art. In brief, the 
molecules are titrated into a plurality of cell cultures or animals and then tested for either 
cell/animal death or prolonged survival of the animal/cells. 

In addition, the peptides of the invention or molecules capable of binding to the peptides 

12 may be complexed with toxins, e.g., ricin or cholera, or with other compounds that are toxic to 
cells. The toxin-binding molecule complex is then targeted to a tumor or other cell by the 
specificity of the binding molecule for SEQ IDNOS: 13902-27802. 

The protein of the invention may also be expressed as a product of transgenic animals, 

16 e.g., as a component of the milk of transgenic cows, goats, pigs, or sheep which are characterized 
by somatic or germ cells containing a nucleotide sequence encoding the protein. 

The proteins provided herein also include proteins characterized by amino acid sequences 
similar to those of purified proteins but into which modification are naturally provided or 

20 deliberately engineered. For example, modifications, in the peptide or DNA sequence, can be 

made by those skilled in the art using known techniques. Modifications of interest in the protein 
sequences may include the alteration, substitution, replacement, insertion or deletion of a 
selected amino acid residue in the coding sequence. For example, one or more of the cysteine 

24 residues may be deleted or replaced with another amino acid to alter the conformation of the 
molecule. Techniques for such alteration, substitution, replacement, insertion or deletion are 
well known to those skilled in the art (see, e.g., U.S. Pat. No. 4,518,584). Preferably, such 
alteration, substitution, replacement, insertion or deletion retains the desired activity of the 

28 protein. Regions of the protein that are important for the protein function can be determined by 
various methods known in the art including the alanine-scanning method which involved 
systematic substitution of single or strings of amino acids with alanine, followed by testing the 
resulting alanine-containing variant for biological activity. This type of analysis determines the 

32 importance of the substituted amino acid(s) in biological activity. Regions of the protein that are 
important for protein function may be determined by the eMATRIX program. 

Other fragments and derivatives of the sequences of proteins which would be expected to 
retain protein activity in whole or in part and are useful for screening or other immunological 
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methodologies may also be easily made by those skilled in the art given the disclosures herein. 
Such modifications are encompassed by the present invention. 

The protein may also be produced by operably linking the isolated polynucleotide of the 
4 invention to suitable control sequences in one or more insect expression vectors, and employing 
an insect expression system. Materials and methods for baculovirus/insect cell expression 
systems are commercially available in kit form from, e.g., Invitrogen, San Diego, Calif, U.S.A. 
(the MaxBat™ kit), and such methods are well known in the art, as described in Summers and 
8 Smith, Texas Agricultural Experiment Station Bulletin No. 1555 (1987), incorporated herein by 
reference. As used herein, an insect cell capable of expressing a polynucleotide of the present 
invention is "transformed." 

The protein of the invention may be prepared by culturing transformed host cells under 

12 culture conditions suitable to express the recombinant protein. The resulting expressed protein 
may then be purified from such culture (i.e., from culture medium or cell extracts) using known 
purification processes, such as gel filtration and ion exchange chromatography. The purification 
of the protein may also include an affinity column containing agents which will bind to the 

16 protein; one or more column steps over such affinity resins as concanavalin A-agarose, 
heparin-toyopearl™ or Cibacrom blue 3GA Sepharose™; one or more steps involving 
hydrophobic interaction chromatography using such resins as phenyl ether, butyl ether, or propyl 
ether; or immunoaffinity chromatography. 

20 Alternatively, the protein of the invention may also be expressed in a form which will 

facilitate purification. For example, it may be expressed as a fusion protein, such as those of 
maltose binding protein (MBP), glutathione-S-transferase (GST) or thioredoxin (TRX), or as a 
His tag. Kits for expression .and purification of such fusion proteins are commercially available 

24 from New England BioLab (Beverly, Mass.), Pharmacia (Piscataway, N.J.) and Invitrogen, 

respectively. The protein can also be tagged with an epitope and subsequently purified by using 
a specific antibody directed to such epitope. One such epitope ("FLAG®") is commercially 
available from Kodak (New Haven, Conn.). 

28 Finally, one or more reverse-phase high performance liquid chromatography (RP- HPLC) 

steps employing hydrophobic RP-HPLC media, e.g., silica gel having pendant methyl or other 
aliphatic groups, can be employed to further purify the protein. Some or all of the foregoing 
purification steps, in various combinations, can also be employed to provide a substantially 

32 homogeneous isolated recombinant protein. The protein thus purified is substantially free of 

other mammalian proteins and is defined in accordance with the present invention as an "isolated 
protein." 
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The polypeptides of the invention include analogs (variants). This embraces fragments, 
as well as peptides in which one or more amino acids has been deleted, inserted, or substituted. 
Also, analogs of the polypeptides of the invention embrace fusions of the polypeptides or 

4 modifications of the polypeptides of the invention, wherein the polypeptide or analog is fused to 
another moiety or moieties, e.g. , targeting moiety or another therapeutic agent. Such analogs 
may exhibit improved properties such as activity and/or stability. Examples of moieties which 
may be fused to the polypeptide or an analog include, for example, targeting moieties which 

8 provide for the delivery of polypeptide to pancreatic cells, e.g. , antibodies to pancreatic cells, 

antibodies to immune cells such as T-cells, monocytes, dendritic cells, granulocytes, etc., as well 
as receptor and ligands expressed on pancreatic or immune cells. Other moieties which may be 
fused to the polypeptide include therapeutic agents which are used for treatment, for example, 
12 immunosuppressive drugs such as cyclosporin, SK506, azathioprine, CD3 antibodies and 

steroids. Also, polypeptides may be fused to immune modulators, and other cytokines such as 
alpha or beta interferon. 

16 4.6.1 DETERMINING POLYPEPTIDE AND POLYNUCLEOTIDE IDENTITY 

AND SIMILARITY 

Preferred identity and/or similarity are designed to give the largest match between the 
sequences tested. Methods to determine identity and similarity are codified in computer 

20 programs including, but are not limited to, the GCG program package, including GAP 

(Devereux, J., et al., Nucleic Acids Research 12(1):387 (1984); Genetics Computer Group, 
University of Wisconsin, Madison, WI), BLASTP, BLASTN, BLASTX, FASTA (Altschul, S.F. 
et ah, J. Molec. Biol. 215:403-410 (1990), PSI-BLAST (Altschul S.F. et aL, Nucleic Acids Res. 

24 vol. 25, pp. 3389-3402, herein incorporated by reference), eMatrix software (Wu et al., J. Comp. 
BioL, Vol. 6, pp. 219-235 (1999), herein incorporated by reference), eMotif software (Nevill- 
Manning et al, ISMB-97, Vol. 4, pp. 202-209, herein incorporated by reference), pFam software 
(Sonnhammer et aL, Nucleic Acids Res., Vol. 26(1), pp. 320-322 (1998), herein incorporated by 

28 reference) and the Kyte-Doolittle hydrophobocity prediction algorithm (J. Mol Biol, 157, pp. 

105-3 1 (1982), incorporated herein by reference). The BLAST programs are publicly available 
from the National Center for Biotechnology Information (NCBI) and other sources (BLAST 
Manual, Altschul, S., et al. NCB NLM NIH Bethesda, MD 20894; Altschul, S., et al., J. Mol. 

32 Biol. 215:403-410 (1990). 

4.7 CHIMERIC AND FUSION PROTEINS 

The invention also provides chimeric or fusion proteins. As used herein, a "chimeric 

protein" or "fusion protein" comprises a polypeptide of the invention operatively linked to 
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another polypeptide. Within a fusion protein the polypeptide according to the invention can 
correspond to all or a portion of a protein according to the invention. In one embodiment;, a 
fusion protein comprises at least one biologically active portion of a protein according to the 

4 invention. In another embodiment, a fusion protein comprises at least two biologically active 
portions of a protein according to the invention. Within the fusion protein, the term "operatively 
linked" is intended to indicate that the polypeptide according to the invention and the other 
polypeptide are fused in-frame to each other. The polypeptide can be fused to the N-terminus or 

8 C-terminus. 

For example, in one embodiment a fusion protein comprises a polypeptide according to 
the invention operably linked to the extracellular domain of a second protein. 

In another embodiment, the fusion protein is a GST-fusion protein in which the 

12 polypeptide sequences of the invention are fused to the C-terminus of the GST (i.e., glutathione 
S-transferase) sequences. 

In another embodiment, the fusion protein is an immunoglobulin fusion protein in which 
the polypeptide sequences according to the invention comprises one or more domains are fused 

16 to sequences derived from a member of the immunoglobulin protein family. The 

immunoglobulin fusion proteins of the invention can be incorporated into pharmaceutical 
compositions and administered to a subject to inhibit an interaction between a ligand and a 
protein of the invention on the surface of a cell, to thereby suppress signal transduction in vivo. 

20 The immunoglobulin fusion proteins can be used to affect the bioavailability of a cognate ligand. 
Inhibition of the ligand/protein interaction may be useful therapeutically for both the treatment of 
proliferative and differentiative disorders, e.g., cancer as well as modulating (e.g., promoting or 
inhibiting) cell survival. Moreover, the immunoglobulin fusion proteins of the invention can be 

24 used as immunogens to produce antibodies in a subject, to purify ligands, and in screening assays 
to identify molecules that inhibit the interaction of a polypeptide of the invention with a ligand. 

A chimeric or fusion protein of the invention can be produced by standard recombinant 
DNA techniques. For example, DNA fragments coding for the different polypeptide sequences 

28 are ligated together in-frame in accordance with conventional techniques, e.g., by employing 
blunt-ended or stagger-ended termini for ligation, restriction enzyme digestion to provide for 
appropriate termini, filling-in of cohesive ends as appropriate, alkaline phosphatase treatment to 
avoid undesirable joining, and enzymatic ligation. In another embodiment, the fusion gene can 

32 be synthesized by conventional techniques including automated DNA synthesizers. 

Alternatively, PGR amplification of gene fragments can be carried out using anchor primers that 
give rise to complementary overhangs between two consecutive gene fragments that can 
subsequently be annealed and reamplified to generate a chimeric gene sequence (see, for 
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example, Ausubel et al. (eds.) Current Protocols in Molecular Biology, John Wiley & 
Sons, 1992). Moreover, many expression vectors axe commercially available that already encode 
a fusion moiety {e.g., a GST polypeptide). A nucleic acid encoding a polypeptide of the 

ml 

4 invention can be cloned into such an expression vector such that the fusion moiety is linked 
in-frame to the protein of the invention. 

4.8 GENE THERAPY 

8 Mutations in the polynucleotides of the invention gene may result in loss of normal 

function of the encoded protein. The invention thus provides gene therapy to restore normal 
activity of the polypeptides of the invention; or to treat disease states involving polypeptides of 
the invention. Delivery of a functional gene encoding polypeptides of the invention to 

12 appropriate cells is effected ex vivo, in situ, or in vivo by use of vectors, and more particularly 
viral vectors (e.g., adenovirus, adeno-associated virus, or a retrovirus), or ex vivo by use of 
physical DNA transfer methods (e.g., liposomes or chemical treatments). See, for example, 
Anderson, Nature, supplement to vol. 392, no. 6679, pp.25~20 (1998). Fof additional reviews of 

16 gene therapy technology see Friedmann, Science, 244: 1275-1281 (1989); Verma, Scientific 
American: 68-84 (1990); and Miller, Nature, 357: 455-460 (1992). Introduction of any one of 
the nucleotides of the present invention or a gene encoding the polypeptides of the present 
invention can also be accomplished with extrachromosomal substrates (transient expression) or 

20 artificial chromosomes (stable expression). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 
Alternatively, it is contemplated that in other human disease states, preventing the expression of 

24 or inhibiting the activity of polypeptides of the invention will be useful in treating the disease 
states. It is contemplated that antisense therapy or gene therapy could be applied to negatively 
regulate the expression of polypeptides of the invention. 

Other methods inhibiting expression of a protein include the introduction of antisense 

28 molecules to the nucleic acids of the present invention, their complements, or their translatedRNA 
sequences, by methods known in the art. Further, the polypeptides of the present invention can be 
inhibited by using targeted deletion methods, or the insertion of a negative regulatory element such 
as a silencer, which is tissue specific. 

32 The present invention still further provides cells genetically engineered in vivo to express the 

polynucleotides of the invention, wherein such polynucleotides are in operative association with a 
regulatory sequence heterologous to the host cell which drives expression of the polynucleotides in 
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the cell. These methods can be used to increase or decrease the expression of the polynucleotides of 
the present invention. 

Knowledge of DNA sequences provided by the invention allows for modification of cells to 
4 permit, increase, or decrease, expression of endogenous polypeptide. Cells can be modified (e.g. , by 
homologous recombination) to provide increased polypeptide expression by replacing, in whole or 
in part, the naturally occurring promoter with all or part of a heterologous promoter so that the cells 
express the protein at higher levels. The heterologous promoter is inserted in such a manner that it is 
8 operatively linked to the desired protein encoding sequences. See, for example, PCT International 
Publication No. WO 94/1 2650, PCT International PublicationNo. WO 92/20808, and PCT 
InternationalPublicationNo. WO 91/09955. It is also contemplated that, in addition to heterologous 
promoter DNA, amplifiable marker DNA (e.g. , ada, dhfr, and the multifunctional CAD gene which 

12 encodes carbamyl phosphate synthase, aspartate transcarbamylase, and dihydroorotase) and/or 

intron DNA may be inserted along with the heterologous promoter DNA. If linked to the desired 
protein coding sequence, amplification of the marker DNA by standard selection methods results in 
co-amplification of the desired protein coding sequences in the cells. 

16 In another embodiment of the present invention, cells and tissues may be engineered to 

express an endogenous gene comprising the polynucleotides of the invention under the control of 
inducible regulatory elements, in which case the regulatory sequences of the endogenous gene may 
be replaced by homologous recombination. As described herein, gene targeting can be used to 

20 replace a gene's existing regulatory region with a regulatory sequence isolated from a different gene 
or a novel regulatory sequence synthesized by genetic engineering methods. Such regulatory 
sequences may be comprised of promoters, enhancers, scaffold-attachment regions, negative 
regulatory elements, transcriptional initiation sites, regulatory protein binding sites or combinations 

24 of said sequences. Alternatively, sequences which affect the structure or stability of the RNA or 
protein produced may be replaced, removed, added, or otherwise modified by targeting. These 
sequences include polyadenylation signals, mRNA stability elements, splice sites, leader sequences 
for enhancing or modifying transport or secretion properties of the protein, or other sequences 

28 which alter or improve the function or stability of protein or RNA molecules. 

The targeting event may be a simple insertion of the regulatory sequence, placing the gene 
under the control of the new regulatory sequence, e.g., inserting a new promoter or enhancer or both 
upstream of a gene. Alternatively, the targeting event may be a simple deletion of a regulatory 

32 element, such as the deletion of a tissue-specific negative regulatory element. Alternatively, the 
targeting event may replace an existing element; for example, a tissue-specific enhancer can be 
replaced by an enhancer that has broader or different cell-type specificity than the naturally 
occurring elements. Here, the naturally occurring sequences are deleted and new sequences are 
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added. In all cases, the identification of the targeting event may be facilitated by the use of one or 
more selectable marker genes that are contiguous with the targeting DNA, allowing for the selection 
of cells in which the exogenous DNA has integrated into the cell genome. The identification of the 
4 targeting event may also be facilitated by the use of one or more marker genes exhibiting the 

property of negative selection, such that the negatively selectable marker is linked to the exogenous 
DNA, but configured such that the negatively selectable marker flanks the targeting sequence, and 
such that a correct homologous recombination event with sequences in the host cell genome does 
8 not result in the stable integration of the negatively selectable marker. Markers useful for this 
purpose include the Herpes Simplex Virus thymidine kinase (TK) gene or the bacterial 
xanthine-guanine phosphoribosyl-transferase (gpt) gene. 

The gene targeting or gene activation techniques which can be used in accordance with this 
1 2 aspect of the invention are more particularly described in U.S. Patent No. 5,272,07 1 to Chappel; 
U.S. Patent No. 5,578,461 to Sherwin et at.; International ApplicationNo. PCT/US92/09627 
(WO93/09222)by Seldenet aL; and International ApplicationNo. PCT/US90/06436 
(W09 1/06667) by Skoultchi et aL, each of which is incorporated by reference herein in its entirety. 

16 

4.9 TRANSGENIC ANIMALS 

In preferred methods to determine biological functions of the polypeptides of the 
invention in vivo, one or more genes provided by the invention are either over expressed or 

20 inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244: 1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 
control of exogenous or endogenous promoter elements, are known as transgenic animals. 
Animals in which an endogenous gene has been inactivated by homologous recombination are 

24 referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 
animals are useful to determine the roles polypeptides of the invention play in biological 
processes, and preferably in disease states. Transgenic animals are useful as model systems to 

28 identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 
Publication No. W094/28122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of a promoter of the 

32 polynucleotides of the invention is either activated or inactivated to alter the level of expression 
of the polypeptides of the invention. Inactivation can be carried out using homologous 
recombination methods described above. Activation can be achieved by supplementing or even 
replacing the homologous promoter to provide for increased protein expression. The homologous 
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promoter can be supplemented by insertion of one or more heterologous enhancer elements 
known to confer promoter activation in a particular tissue. 

The polynucleotides of the present invention also make possible the development, 
4 through, e.g., homologous recombination or knock out strategies, of animals that fail to express 
polypeptides of the invention or that express a variant polypeptide. Such animals are useful as 
models for studying the in vivo activities of polypeptide as well as for studying modulators of the 
polypeptides of the invention. 
8 In preferred methods to determine biological functions of the polypeptides of the 

invention in vivo, one or more genes provided by the invention are either over expressed or 
inactivated in the germ line of animals using homologous recombination [Capecchi, Science 
244:1288-1292 (1989)]. Animals in which the gene is over expressed, under the regulatory 

12 control of exogenous or endogenous promoter elements, are known as transgenic animals. 

Animals in which an endogenous gene has been inactivated by homologous recombination are 
referred to as "knockout" animals. Knockout animals, preferably non-human mammals, can be 
prepared as described in U.S. Patent No. 5,557,032, incorporated herein by reference. Transgenic 

1 6 animals are useful to determine the roles polypeptides of the invention play in biological 

processes, and preferably in disease states. Transgenic animals are useful as model systems to 
identify compounds that modulate lipid metabolism. Transgenic animals, preferably non-human 
mammals, are produced using methods as described in U.S. Patent No 5,489,743 and PCT 

20 Publication No. W094/28 122, incorporated herein by reference. 

Transgenic animals can be prepared wherein all or part of the polynucleotides of the 
invention promoter is either activated or inactivated to alter the level of expression of the 
polypeptides of the invention. Inactivation can be carried out using homologous recombination 

24 methods described above. Activation can be achieved by supplementing or even replacing the 
homologous promoter to provide for increased protein expression. The homologous promoter 
can be supplemented by insertion of one or more heterologous enhancer elements known to 
confer promoter activation in a particular tissue. 

28 

4.10 USES AND BIOLOGICAL ACTIVITY 

The polynucleotides and proteins of the present invention are expected to exhibit one or 
more of the uses or biological activities (including those associated with assays cited herein) 
32 identified herein. Uses or activities described for proteins of the present invention may be 

provided by administration or use of such proteins or of polynucleotides encoding such proteins 
(such as, for example, in gene therapies or vectors suitable for introduction of DNA). The 
mechanism underlying the particular condition or pathology will dictate whether the 
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polypeptides of the invention, the polynucleotides of the invention or modulators (activators or 
inhibitors) thereof would be beneficial to the subject in need of treatment. Thus, "therapeutic 
compositions of the invention" include compositions comprising isolated polynucleotides 
4 (including recombinant DNA molecules, cloned genes and degenerate variants thereof) or 
polypeptides of the invention (including full length protein, mature protein and truncations or 
domains thereof), or compounds and other substances that modulate the overall activity of the 
target gene products, either at the level of target gene/protein expression or target protein 
8 activity. Such modulators include polypeptides, analogs, (variants), including fragments and 
fusion proteins, antibodies and other binding proteins; chemical compounds that directly or 
indirectly activate or inhibit the polypeptides of the invention (identified, e.g., via drug screening 
assays as described herein); antisense polynucleotides and polynucleotides suitable for triple 
12 helix formation; and in particular antibodies or other binding partners that specifically recognize 
one or more epitopes of the polypeptides of the invention. 

The polypeptides of the present invention may likewise be involved in cellular activation 
or in one of the other physiological pathways described herein. 

16 

4.10.1 RESEARCH USES AND UTILITIES 

The polynucleotides provided by the present invention can be used by the research 
community for various purposes. The polynucleotides can be used to express recombinant 

20 protein for analysis, characterization or therapeutic use; as markers for tissues in which the 

corresponding protein is preferentially expressed (either constitutively or at a particular stage of 
tissue differentiation or development or in disease states); as molecular weight markers on gels; 
as chromosome markers or tags (when labeled) to identify chromosomes or to map related gene 

24 positions; to compare with endogenous DNA sequences in patients to identify potential genetic 
disorders; as probes to hybridize and thus discover novel, related DNA sequences; as a source of 
information to derive PCR primers for genetic fingerprinting; as a probe to "subtract-out" known 
sequences in the process of discovering other novel polynucleotides; for selecting and making 

28 oligomers for attachment to a "gene chip" or other support, including for examination of 

expression patterns; to raise anti-protein antibodies using DNA immunization techniques; and as 
an antigen to raise anti-DNA antibodies or elicit another immune response. Where the 
polynucleotide encodes a protein which binds or potentially binds to another protein (such as, for 

32 example, in a receptor-ligand interaction), the polynucleotide can also be used in interaction trap 
assays (such as, for example, that described in Gyuris et al. ? Cell 75:791-803 (1993)) to identify 
polynucleotides encoding the other protein with which binding occurs or to identify inhibitors of 
the binding interaction. 



38 



WO 01/64835 



PCT/US01/04927 



The polypeptides provided by the present invention can similarly be used in assays to 
determine biological activity, including in a panel of multiple proteins for high-throughput 
screening; to raise antibodies or to elicit another immune response; as a reagent (including the 
4 labeled reagent) in assays designed to quantitatively determine levels of the protein (or its 

receptor) in biological fluids; as markers for tissues in which the corresponding polypeptide is 
preferentially expressed (either constitutively or at a particular stage of tissue differentiation or 
development or in a disease state); and, of course, to isolate correlative receptors or ligands. 
8 Proteins involved in these binding interactions can also be used to screen for peptide or small 
molecule inhibitors or agonists of the binding interaction. 

Any or all of these research utilities are capable of being developed into reagent grade or 
kit format for commercialization as research products. 
12 Methods for performing the uses listed above are well known to those skilled in the art. 

References disclosing such methods include without limitation "Molecular Cloning: A 
Laboratory Manual", 2d ed., Cold Spring Harbor Laboratory Press, Sambrook, J., E. F. Fritsch 
and T. Maniatis eds., 1989, and "Methods in Enzymology: Guide to Molecular Cloning 
16 Techniques", Academic Press, Berger, S. L. and A. R. Kimmel eds., 1987. 

4.10.2 NUTRITIONAL USES 

Polynucleotides and polypeptides of the present invention can also be used as nutritional 
20 sources or supplements. Such uses include without limitation use as a protein or amino acid 

supplement, use as a carbon source, use as a nitrogen source and use as a source of carbohydrate. In 
such cases the polypeptide or polynucleotide of the invention can be added to the feed of a 
particular organism or can be administered as a separate solid or liquid preparation, such as in the 
24 form of powder, pills, solutions, suspensions or capsules. In the case of microorganisms, the 
polypeptide or polynucleotide of the invention can be added to the medium in or on which the 
microorganism is cultured. 

28 4.103 CYTOKINE AND CELL PROLIFERATION/DIFFERENTIATION 

ACTIVITY 

A polypeptide of the present invention may exhibit activity relating to cytokine, cell 
proliferation (either inducing or inhibiting) or cell differentiation (either inducing or inhibiting) 
32 activity or may induce production of other cytokines in certain cell populations. A 

polynucleotide of the invention can encode a polypeptide exhibiting such attributes. Many 
protein factors discovered to date, including all known cytokines, have exhibited activity in one 
or more factor-dependent cell proliferation assays, and hence the assays serve as a convenient 
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confirmation of cytokine activity. The activity of therapeutic compositions of the present 
invention is evidenced by any one of a number of routine factor dependent cell proliferation 
assays for cell lines including, without limitation, 32D, DA2, DA1G, T10, B9, B9/1 1, BaF3, 
4 MC9/G, M+(preB M+), 2ES, RB5, DAI, 123, T1165, HT2, CTLL2 5 TF-1, Mo7e, CMK, 

HUVEC, and Caco. Therapeutic compositions of the invention can be used in the following: 

Assays for T-cell or thymocyte proliferation include without limitation those described 
in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
8 M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et al., J. Immunol. 137:3494-3500, 1986; Bertagnolli et al., J. Immunol. 
145:1706-1712, 1990; Bertagnolli et al., Cellular Immunology 133:327-341, 1991; Bertagnolli, 

12 et al., I. Immunol. 149:3778-3783, 1992; Bowman et al., I. Immunol. 152:1756-1761, 1994. 

Assays for cytokine production and/or proliferation of spleen cells, lymph node cells or 
thymocytes include, without limitation, those described in: Polyclonal T cell stimulation, 
Kruisbeek, A. M. and Shevach, E. M. In Current Protocols in Immunology. J. E. e.a. Coligan 

16 eds. Vol 1 pp. 3.12.1-3.12.14, John Wiley and Sons, Toronto. 1994; and Measurement of mouse 
and human interleukin-y, Schreiber, R. D. In Current Protocols in Immunology. J. E. e.a. Coligan 
eds. Vol 1 pp. 6.8.1-6.8.8, John Wiley and Sons, Toronto. 1994. 

Assays for proliferation and differentiation of hematopoietic and lymphopoietic cells 

20 include, without limitation, those described in: Measurement of Human and Murine Interleukin 2 
and Interleukin 4, Bottomly, K., Davis, L. S. and Lipsky, P. E. In Current Protocols in 
Immunology. J. E. e.a. Coligan eds. Vol 1 pp. 6.3.1-6.3.12, John Wiley and Sons, Toronto. 1991; 
deVries et al., J. Exp. Med. 173:1205-1211, 1991; Moreau et al., Nature 336:690-692, 1988; 

24 Greenberger et al., Proc. Natl. Acad. Sci. U.S.A. 80:2931-2938, 1983; Measurement of mouse 

and human interleukin 6— Nordan, R, In Current Protocols in Immunology. J. E. Coligan eds. Vol 
1 pp. 6.6.1-6.6.5, John Wiley and Sons, Toronto. 1991; Smith et al., Proc. Natl. Aced. Sci. 
U.S.A. 83:1857-1861, 1986; Measurement of human Interleukin 11 -Bennett, F., Giannotti, J., 

28 Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. J. E. Coligan eds. Vol 1 pp. 
6.15.1 John Wiley and Sons, Toronto. 1991; Measurement of mouse and human Interleukin 
9— Ciarletta, A., Giannotti, J., Clark, S. C. and Turner, K. J. In Current Protocols in Immunology. 
J. E. Coligan eds. Vol 1 pp. 6.13.1, John Wiley and Sons, Toronto. 1991. 

32 Assays for T-cell clone responses to antigens (which will identify, among others, proteins 

that affect APC-T cell interactions as well as direct T-cell effects by measuring proliferation and 
cytokine production) include, without limitation, those described in: Current Protocols in 
Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. Shevach, W Strober, 



40 



WO 01/64835 



PCT/US01/04927 



Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, In Vitro assays for Mouse 
Lymphocyte Function; Chapter 6, Cytokines and their cellular receptors; Chapter 7, 
Immunologic studies in Humans); Weinberger et al., Proc. Natl. Acad. Sci. USA 77:6091-6095, 
4 1980; Weinberger et aL, Eur. J. Immun. 1 1 :405-41 1, 1981; Takai et al., J. Immunol. 
137:3494-3500, 1986; Takai et al., J. Immunol. 140:508-512, 1988. 

4.10.4 STEM CELL GROWTH FACTOR ACTIVITY 

8 A polypeptide of the present invention may exhibit stem cell growth factor activity and 

be involved in the proliferation, differentiation and survival of pluripotent and totipotent stem 
cells including primordial germ cells, embryonic stem cells, hematopoietic stem cells and/or 
germ line stem cells. Administration of the polypeptide of the invention to stem cells in vivo or 

12 ex vivo is expected to maintain and expand cell populations in a totipotential or pluripotential 
state which would be useful for re-engineering damaged or diseased tissues, transplantation, 
manufacture of bio-pharmaceuticals and the development of bio-sensors. The ability to produce 
large quantities of human cells has important working applications for the production of human 

1 6 proteins which currently must be obtained from non-human sources or donors, implantation of 
cells to treat diseases such as Parkinson's, Alzheimer's and other neurodegenerative diseases; 
tissues for grafting such as bone marrow, skin, cartilage, tendons, bone, muscle (including 
cardiac muscle), blood vessels, cornea, neural cells, gastrointestinal cells and others; and organs 

20 for transplantation such as kidney, liver, pancreas (including islet cells), heart and lung. 

It is contemplated that multiple different exogenous growth factors and/or cytokines may 
be administered in combination with the polypeptide of the invention to achieve the desired 
effect, including any of the growth factors listed herein, other stem cell maintenance factors, and 

24 specifically including stem cell factor (SCF), leukemia inhibitory factor (LIF), Flt-3 ligand (Flt- 
3L), any of the interleukins, recombinant soluble IL-6 receptor fused to IL-6, macrophage 
inflammatory protein 1 -alpha (MIP-1 -alpha), G-CSF, GM-CSF, thrombopoietin (TPO), platelet 
factor 4 (PF-4), platelet-derived growth factor (PDGF), neural growth factors and basic 

28 fibroblast growth factor (bFGF). 

Since totipotent stem cells can give rise to virtually any mature cell type, expansion of 
these cells in culture will facilitate the production of large quantities of mature cells. Techniques 
for culturing stem cells are known in the art and administration of polypeptides of the invention, 

32 optionally with other growth factors and/or cytokines, is expected to enhance the survival and 
proliferation of the stem cell populations. This can be accomplished by direct administration of 
the polypeptide of the invention to the culture medium. Alternatively, stroma cells transfected 
with a polynucleotide that encodes for the polypeptide of the invention can be used as a feeder 



41 



WO 01/64835 



PCT7US0 1/04927 



layer for the stem cell populations in culture or in vivo. Stromal support cells for feeder layers 
may include embryonic bone marrow fibroblasts, bone marrow stromal cells, fetal liver cells, or 
cultured embryonic fibroblasts (see U.S. Patent No. 5,690,926). 
4 Stem cells themselves can be transfected with a polynucleotide of the invention to induce 

autocrine expression of the polypeptide of the invention. This will allow for generation of 
undifferentiated totipotential/pluripotential stem cell lines that are useful as is or that can then be 
differentiated into the desired mature cell types. These stable cell lines can also serve as a source 
8 of undifferentiated totipotential/pluripotential mRNA to create cDNA libraries and templates for 
polymerase chain reaction experiments. These studies would allow for the isolation and 
identification of differentially expressed genes in stem cell populations that regulate stem cell 
proliferation and/or maintenance. 

12 Expansion and maintenance of totipotent stem cell populations will be useful in the 

treatment of many pathological conditions. For example, polypeptides of the present invention 
may be used to manipulate stem cells in culture to give rise to neuroepithelial cells that can be 
used to augment or replace cells damaged by illness, autoimmune disease, accidental damage or 

16 genetic disorders. The polypeptide of the invention may be useful for inducing the proliferation 
of neural cells and for the regeneration of nerve and brain tissue, le. for the treatment of central 
and peripheral nervous system diseases and neuropathies, as well as mechanical and traumatic 
disorders which involve degeneration, death or trauma to neural cells or nerve tissue. In addition, 

20 the expanded stem cell populations can also be genetically altered for gene therapy purposes and 
to decrease host rejection of replacement tissues after grafting or implantation. 

Expression of the polypeptide of the invention and its effect on stem cells can also be 
manipulated to achieve controlled differentiation of the stem cells into more differentiated cell 

24 types. A broadly applicable method of obtaining pure populations of a specific differentiated 
cell type from undifferentiated stem cell populations involves the use of a cell-type specific 
promoter driving a selectable marker. The selectable marker allows only cells of the desired type 
to survive. For example, stem cells can be induced to differentiate into cardiomyocytes (Wobus 

28 et ah, Differentiation, 48: 173-182, (1991); Klug et ah, J. Clin. Invest, 98(1): 216-224, (1998)) 
or skeletal muscle cells (Browder, L. W. In: Principles of Tissue Engineering eds. Lanza et al., 
Academic Press (1997)). Alternatively, directed differentiation of stem cells can be 
accomplished by culturing the stem cells in the presence of a differentiation factor such as 

32 retinoic acid and an antagonist of the polypeptide of the invention which would inhibit the 
effects of endogenous stem cell factor activity and allow differentiation to proceed. 

In vitro cultures of stem cells can be used to determine if the polypeptide of the invention 
exhibits stem cell growth factor activity. Stem cells are isolated from any one of various cell 
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sources (including hematopoietic stem cells and embryonic stem cells) and cultured on a feeder 
layer, as described by Thompson et al. Proc. Natl. Acad. Sci, U.S.A., 92: 7844-7848 (1995), in 
the presence of the polypeptide of the invention alone or in combination with other growth 
4 factors or cytokines. The ability of the polypeptide of the invention to induce stem cells 
proliferation is determined by colony formation on semi-solid support e.g. as described by 
Bernstein et al., Blood, 77: 23 16-2321 (1991). 

8 4.10.5 HEMATOPOIESIS REGULATING ACTIVITY 

A polypeptide of the present invention may be involved in regulation of hematopoiesis 
and, consequently, in the treatment of myeloid or lymphoid cell disorders. Even marginal 
biological activity in support of colony forming cells or of factor-dependent cell lines indicates 

12 involvement in regulating hematopoiesis, e.g. in supporting the growth and proliferation of 
erythroid progenitor cells alone or in combination with other cytokines, thereby indicating 
utility, for example, in treating various anemias or for use in conjunction with 
irradiation/chemotherapy to stimulate the production of erythroid precursors and/or erythroid 

16 cells; in supporting the growth and proliferation of myeloid cells such as granulocytes and 

monocytes/macrophages (f.e., traditional CSF activity) useful, for example, in conjunction with 
chemotherapy to prevent or treat consequent myelo-suppression; in supporting the growth and 
proliferation of megakaryocytes and consequently of platelets thereby allowing prevention or 

20 treatment of various platelet disorders such as thrombocytopenia, and generally for use in place 
of or complimentary to platelet transfusions; and/or in supporting the growth and proliferation of 
hematopoietic stem cells which are capable of maturing to any and all of the above-mentioned 
hematopoietic cells and therefore find therapeutic utility in various stem cell disorders (such as 

24 those usually treated with transplantation, including, without limitation, aplastic anemia and 
paroxysmal nocturnal hemoglobinuria), as well as in repopulating the stem cell compartment 
post irradiation/chemotherapy, either in-vivo or ex-vivo {i.e., in conjunction with bone marrow 
transplantation or with peripheral progenitor cell transplantation (homologous or heterologous)) 

28 as normal cells or genetically manipulated for gene therapy. 

Therapeutic compositions of the invention can be used in the following: 
Suitable assays for proliferation and differentiation of various hematopoietic lines are 
cited above. 

32 Assays for embryonic stem cell differentiation (which will identify, among others, 

proteins that influence embryonic differentiation hematopoiesis) include, without limitation, 
those described in: Johansson et al. Cellular Biology 15:141-151, 1995; Keller et al., Molecular 
and Cellular Biology 13:473-486, 1993; McClanahanet al., Blood 81:2903-2915, 1993. 
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Assays for stem cell survival and differentiation (which will identify, among others, 
proteins that regulate lympho-hematopoiesis) include, without limitation, those described in: 
Methylcellulose colony forming assays, Freshney, M. G. In Culture of Hematopoietic Cells. R. I. 

4 Freshney, et aL eds. Vol pp. 265-268, Wiley-Liss, Inc., New York, N.Y. 1994; Hirayama et al., 
Proc. Natl. Acad. Sci. USA 89:5907-591 1, 1992; Primitive hematopoietic colony forming cells 
with high proliferative potential, McNiece, I. K. and Briddell, R. A. In Culture of Hematopoietic 
Cells. R. I. Freshney, et al. eds. Vol pp. 23-39, Wiley-Liss, Inc., New York, N.Y. 1994; Neben et 

8 al., Experimental Hematology 22:353-359, 1994; Cobblestone area forming cell assay, 

Ploemacher, R. E. In Culture of Hematopoietic Cells. R. I. Freshney, et al. eds. Vol pp. 1-21, 
Wiley-Liss, Inc., New York, N.Y. 1994; Long term bone marrow cultures in the presence of 
stromal cells, Spooncer, E., Dexter, M. and Allen, T. In Culture of Hematopoietic Cells. R. I. 
12 Freshney, et al. eds. Vol pp. 163-179, Wiley-Liss, Inc., New York, N.Y. 1994; Long term culture 
initiating cell assay, Sutherland, H. J. In Culture of Hematopoietic Cells. R. I. Freshney, et al. 
eds. Vol pp. 139-162, Wiley-Liss, Inc., New York, N.Y. 1994. 

16 4.10,6 TISSUE GROWTH ACTIVITY 

A polypeptide of the present invention also may be involved in bone, cartilage, tendon, 
ligament and/or nerve tissue growth or regeneration, as well as in wound healing and tissue 
repair and replacement, and in healing of burns, incisions and ulcers. 

20 A polypeptide of the present invention which induces cartilage and/or bone growth in 

circumstances where bone is not normally formed, has application in the healing of bone 
fractures and cartilage damage or defects in humans and other animals. Compositions of a 
polypeptide, antibody, binding partner, or other modulator of the invention may have 

24 prophylactic use in closed as well as open fracture reduction and also in the improved fixation of 
artificial joints. De novo bone formation induced by an osteogenic agent contributes to the 
repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and 
also is useful in cosmetic plastic surgery. 

28 A polypeptide of this invention may also be involved in attracting bone-forming cells, 

stimulating growth of bone-forming cells, or inducing differentiation of progenitors of 
bone-forming cells. Treatment of osteoporosis, osteoarthritis, bone degenerative disorders, or 
periodontal disease, such as through stimulation of bone and/or cartilage repair or by blocking 

32 inflammation or processes of tissue destruction (collagenase activity, osteoclast activity, etc.) 
mediated by inflammatory processes may also be possible using the composition of the 
invention. 
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Another category of tissue regeneration activity that may involve the polypeptide of the 
present invention is tendon/ligament formation. Induction of tendon/ligament-like tissue or 
other tissue formation in circumstances where such tissue is not normally formed, has 

4 application in the healing of tendon or ligament tears, deformities and other tendon or ligament 
defects in humans and other animals. Such a preparation employing a tendon/ligament-like 
tissue inducing protein may have prophylactic use in preventing damage to tendon or ligament 
tissue, as well as use in the improved fixation of tendon or ligament to bone or other tissues, and 

8 in repairing defects to tendon or ligament tissue. De novo tendon/ligament-like tissue formation 
induced by a composition of the present invention contributes to the repair of congenital, trauma 
induced, or other tendon or ligament defects of other origin, and is also useful in cosmetic plastic 
surgery for attachment or repair of tendons or ligaments. The compositions of the present 
12 invention may provide environment to attract tendon- or ligament-forming cells, stimulate 

growth of tendon- or ligament-forming cells, induce differentiation of progenitors of tendon- or 
ligament-fomiing cells, or induce growth of tendon/ligament cells or progenitors ex vivo for 
return in vivo to effect tissue repair. The compositions of the invention may also be useful in the 
16 treatment of tendinitis, carpal tunnel syndrome and other tendon or ligament defects. The 

compositions may also include an appropriate matrix and/or sequestering agent as a carrier as is 
well known in the art. 

The compositions of the present invention may also be useful for proliferation of neural 

20 cells and for regeneration of nerve and brain tissue, i.e. for the treatment of central and peripheral 
nervous system diseases and neuropathies, as well as mechanical and traumatic disorders, which 
involve degeneration, death or trauma to neural cells or nerve tissue. More specifically, a 
composition may be used in the treatment of diseases of the peripheral nervous system, such as 

24 peripheral nerve injuries, peripheral neuropathy and localized neuropathies, and central nervous 
system diseases, such as Alzheimer's, Parkinson's disease, Huntington's disease, amyotrophic 
lateral sclerosis, and Shy-Drager syndrome. Further conditions which may be treated in 
accordance with the present invention include mechanical and traumatic disorders, such as spinal 

28 cord disorders, head trauma and cerebrovascular diseases such as stroke. Peripheral neuropathies 
resulting from chemotherapy or other medical therapies may also be treatable using a 
composition of the invention. 

Compositions of the invention may also be useful to promote better or faster closure of 

32 non-healing wounds, including without limitation pressure ulcers, ulcers associated with vascular 
insufficiency, surgical and traumatic wounds, and the like. 

Compositions of the present invention may also be involved in the generation or 
regeneration of other tissues, such as organs (including, for example, pancreas, liver, intestine, 
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kidney, skin, endothelium), muscle (smooth, skeletal or cardiac) and vascular (including vascular 
endothelium) tissue, or for promoting the growth of cells comprising such tissues. Part of the 
desired effects may be by inhibition or modulation of fibrotic scarring may allow normal tissue 
4 to regenerate. A polypeptide of the present invention may also exhibit angiogenic activity. 
A composition of the present invention may also be useful for gut protection or 
regeneration and treatment of lung or liver fibrosis, reperfusion injury in various tissues, and 
conditions resulting from systemic cytokine damage. 
8 A composition of the present invention may also be useful for promoting or inhibiting 

differentiation of tissues described above from precursor tissues or cells; or for inhibiting the 
growth of tissues described above. 

Therapeutic compositions of the invention can be used in the following: 
12 Assays for tissue generation activity include, without limitation, those described in: 

International Patent Publication No. WO95/16035 (bone, cartilage, tendon); International Patent 
Publication No. WO95/05846 (nerve, neuronal); International Patent Publication No. 
WO91/07491 (skin, endothelium). 
16 Assays for wound healing activity include, without limitation, those described in: Winter, 

Epidermal Wound Healing, pps. 71-112 (Maibach, H. I. and Rovee, D. T., eds.), Year Book 
Medical Publishers, Inc., Chicago, as modified by Eaglstein and Mertz, J. Invest. Dermatol 
71:382-84(1978). 

20 

4.10.7 IMMUNE STIMULATING OR SUPPRESSING ACTIVITY 

A polypeptide of the present invention may also exhibit immune stimulating or immune 
suppressing activity, including without limitation the activities for which assays are described 

24 herein. A polynucleotide of the invention can encode a polypeptide exhibiting such activities. A 
protein may be useful in the treatment of various immune deficiencies and disorders (including 
severe combined immunodeficiency (SCID)), e.g., in regulating (up or down) growth and 
proliferation of T and/or B lymphocytes, as well as effecting the cytolytic activity of NK cells 

28 and other cell populations. These immune deficiencies may be genetic or be caused by viral (e.g., 
HIV) as well as bacterial or fungal infections, or may result from autoimmune disorders. More 
specifically, infectious diseases causes by viral, bacterial, fungal or other infection may be 
treatable using a protein of the present invention, including infections by HIV, hepatitis viruses, 

32 herpes viruses, mycobacteria, Leishmania spp., malaria spp. and various fungal infections such 
as candidiasis. Of course, in this regard, proteins of the present invention may also be useful 
where a boost to the immune system generally may be desirable, z.e., in the treatment of cancer. 
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Autoimmune disorders which may be treated using a protein of the present invention 
include, for example, connective tissue disease, multiple sclerosis, systemic lupus erythematosus, 
rheumatoid arthritis,, autoimmune pulmonary inflammation, Guillain-Barre syndrome, 
4 autoimmune thyroiditis, insulin dependent diabetes mellitis, myasthenia gravis, graft-versus-host 
disease and autoimmune inflammatory eye disease. Such a protein (or antagonists thereof, 
including antibodies) of the present invention may also to be useful in the treatment of allergic 
reactions and conditions (e.g., anaphylaxis, serum sickness, drug reactions, food allergies, insect 
8 venom allergies, mastocytosis, allergic rhinitis, hypersensitivity pneumonitis, urticaria, 
angioedema, eczema, atopic dermatitis, allergic contact dermatitis, erythema multiforme, 
Stevens-Johnson syndrome, allergic conjunctivitis, atopic keratoconjunctivitis, venereal 
keratoconjunctivitis, giant papillary conjunctivitis and contact allergies), such as asthma 
. 12 (particularly allergic asthma) or other respiratory problems. Other conditions, in which immune 
suppression is desired (including, for example, organ transplantation), may also be treatable 
using a protein (or antagonists thereof) of the present invention. The therapeutic effects of the 
polypeptides or antagonists thereof on allergic reactions can be evaluated by in vivo animals 

16 models such as the cumulative contact enhancement test (Lastbom et al., Toxicology 125: 59-66, 
1998), skin prick test (Hoffmann et aL, Allergy 54: 446-54, 1999), guinea pig skin sensitization 
test (Vohr et al., Arch. Toxocol. 73: 501-9), and murine local lymph node assay (Kimber et ah, 
J. Toxicol. Environ. Health 53: 563-79). 

20 Using the proteins of the invention it may also be possible to modulate immune 

responses, in a number of ways. Down regulation may be in the form of inhibiting or blocking an 
immune response already in progress or may involve preventing the induction of an immune 
response. The functions of activated T cells may be inhibited by suppressing T cell responses or 

24 by inducing specific tolerance in T cells, or both. Immunosuppression of T cell responses is 

generally an active, non-antigen-specific, process which requires continuous exposure of the T 
cells to the suppressive agent. Tolerance, which involves inducing non-responsiveness or anergy 
in T cells, is distinguishable from immunosuppression in that it is generally antigen-specific and 

28 persists after exposure to the tolerizing agent has ceased. Operationally, tolerance can be 

demonstrated by the lack of a T cell response upon reexposure to specific antigen in the absence 
of the tolerizing agent. 

Down regulating or preventing one or more antigen functions (including without 

32 limitation B lymphocyte antigen functions (such as, for example, B7)), e.g., preventing high 
level lymphokine synthesis by activated T cells, will be useful in situations of tissue, skin and 
organ transplantation and in graft-versus-host disease (GVHD). For example, blockage of T cell 
function should result in reduced tissue destruction in tissue transplantation. Typically, in tissue 
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transplants, rejection of the transplant is initiated through its recognition as foreign by T cells, 
followed by an immune reaction that destroys the transplant. The administration of a therapeutic 
composition of the invention may prevent cytokine synthesis by immune cells, such as T cells, 
4 and thus acts as an immunosuppressant. Moreover, a lack of costimulation may also be sufficient 
to anergize the T cells, thereby inducing tolerance in a subject. Induction of long-term tolerance 
by B lymphocyte antigen-blocking reagents may avoid the necessity of repeated administration 
of these blocking reagents. To achieve sufficient immunosuppression or tolerance in a subject, it 
8 may also be necessary to block the function of a combination of B lymphocyte antigens. 

The efficacy of particular therapeutic compositions in preventing organ transplant 
rejection or GVHD can be assessed using animal models that are predictive of efficacy in 
humans. Examples of appropriate systems which can be used include allogeneic cardiac grafts in 

12 rats and xenogeneic pancreatic islet cell grafts in mice, both of which have been used to examine 
the immunosuppressive effects of CTLA4Ig fusion proteins in vivo as described in Lenschow et 
aL, Science 257:789-792 (1992) and Turka et al., Proc. Natl. Acad. Sci USA, 89:1 1 102-1 1 105 
(1992). In addition, murine models of GVHD (see Paul ed. ? Fundamental Immunology, Raven 

16 Press, New York, 1989, pp. 846-847) can be used to determine the effect of therapeutic 
compositions of the invention on the development of that disease. 

Blocking antigen function may also be therapeutically useful for treating autoimmune 
diseases. Many autoimmune disorders are the result of inappropriate activation of T cells that are 

20 reactive against self tissue and which promote the production of cytokines and autoantibodies 
involved in the pathology of the diseases. Preventing the activation of autoreactive T cells may 
reduce or eliminate disease symptoms. Administration of reagents which block stimulation of T 
cells can be used to inhibit T cell activation and prevent production of autoantibodies or T 

24 cell-derived cytokines which may be involved in the disease process. Additionally, blocking 
reagents may induce antigen-specific tolerance of autoreactive T cells which could lead to 
long-term relief from the disease. The efficacy of blocking reagents in preventing or alleviating 
autoimmune disorders can be determined using a number of well-characterized animal models of 

28 human autoimmune diseases. Examples include murine experimental autoimmune encephalitis, 
systemic lupus erythmatosis in MRL/lpr/lpr mice orNZB hybrid mice, murine autoimmune 
collagen arthritis, diabetes mellitus in NOD mice and BB rats, and murine experimental 
myasthenia gravis (see Paul ed., Fundamental Immunology, Raven Press, New York, 1989, pp. 

32 840-856). 

Upregulation of an antigen function (e.g., a B lymphocyte antigen function), as a means 
of up regulating immune responses, may also be useful in therapy. Upregulation of immune 
responses may be in the form of enhancing an existing immune response or eliciting an initial 
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immune response. For example, enhancing an immune response may be useful in cases of viral 
infection, including systemic viral diseases such as influenza, the common cold, and 
encephalitis. 

4 Alternatively, anti-viral immune responses may be enhanced in an infected patient by 

removing T cells from the patient, costimulating the T cells in vitro with viral antigen-pulsed 
APCs either expressing a peptide of the present invention or together with a stimulatory form of 
a soluble peptide of the present invention and reintroducing the in vitro activated T cells into the 
8 patient. Another method of enhancing anti-viral immune responses would be to isolate infected 
cells from a patient, transfect them with a nucleic acid encoding a protein of the present 
invention as described herein such that the cells express all or a portion of the protein on their 
surface, and reintroduce the transfected cells into the patient. The infected cells would now be 

12 capable of delivering a costimulatory signal to, and thereby activate, T cells in vivo. 

A polypeptide of the present invention may provide the necessary stimulation signal to T 
cells to induce a T cell mediated immune response against the transfected tumor cells. In 
addition, tumor cells which lack MHC class I or MHC class II molecules, or which fail to 

16 reexpress sufficient mounts of MHC class I or MHC class II molecules, can be transfected with 
nucleic acid encoding all or a portion of (e.g., a cytoplasmic-domain truncated portion) of an 
MHC class I alpha chain protein and p2 microglobulin protein or an MHC class II alpha chain 
protein and an MHC class II beta chain protein to thereby express MHC class I or MHC class II 

20 proteins on the cell surface. Expression of the appropriate class I or class II MHC in conjunction 
with a peptide having the activity of a B lymphocyte antigen (e.g., B7-1, B7-2, B7-3) induces a T 
cell mediated immune response against the transfected tumor cell. Optionally, a gene encoding 
an antisense construct which blocks expression of an MHC class II associated protein, such as 

24 the invariant chain, can also be cotransfected with a DNA encoding a peptide having the activity 
of a B lymphocyte antigen to promote presentation of tumor associated antigens and induce 
tumor specific immunity. Thus, the induction of a T cell mediated immune response in a human 
subject may be sufficient to overcome tumor-specific tolerance in the subject. 

28 The activity of a protein of the invention may, among other means, be measured by the 

following methods: 

Suitable assays for thymocyte or splenocyte cytotoxicity include, without limitation, 
those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. 
32 H. Margulies, E. M. Shevach, W. Strober, Pub. Greene Publishing Associates and 

Wiley-Interscience (Chapter 3, In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; 
Chapter 7, Immunologic studies in Humans); Herrmann et al., Proc. Natl. Acad. Sci. USA 
78:2488-2492, 1981 ; Herrmann et al., J. Immunol. 128:1968-1974, 1982; Handa et al., J. 
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Immunol. 135:1564-1572, 1985; Takai et aL, I. ImmunoL 137:3494-3500, 1986; Takai et aL, J. 
Immunol. 140:508-512, 1988; Bowman et aL, J. Virology 61:1992-1998; Bertagnolli et al., 
Cellular Immunology 133:327-341, 1991; Brown et aL, J. Immunol. 153:3079-3092, 1994. 
4 Assays for T-cell-dependent immunoglobulin responses and isotype switching (which 

will identify, among others, proteins that modulate T-cell dependent antibody responses and that 
affect Thl/Th2 profiles) include, without limitation, those described in: Maliszewski, J. 
ImmunoL 144:3028-3033, 1990; and Assays for B cell function: In vitro antibody production, 
8 Mond, J. J. and Brunswick, M. In Current Protocols in Immunology. J. E. e.a. Coligan eds. Vol 1 
pp. 3.8.1-3.8.16, John Wiley and Sons, Toronto. 1994. 

Mixed lymphocyte reaction (MLR) assays (which will identify, among others, proteins 
that generate predominantly Thl and CTL responses) include, without limitation, those described 

12 in: Current Protocols in Immunology, Ed by J. E. Coligan, A. M. Kruisbeek, D. H. Margulies, E. 
M. Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley-Interscience (Chapter 3, 
In Vitro assays for Mouse Lymphocyte Function 3.1-3.19; Chapter 7, Immunologic studies in 
Humans); Takai et aL, J. Immunol. 137:3494-3500, 1986; Takai et al., J. ImmunoL 140:508-512, 

16 1988; Bertagnolli et al., J. Immunol. 149:3778-3783, 1992. 

Dendritic cell-dependent assays (which will identify, among others, proteins expressed 
by dendritic cells that activate naive T-cells) include, without limitation, those described in: 
Guery et aL, J. Immunol. 134:536-544, 1995; Inaba et aL, Journal of Experimental Medicine 

20 173:549-559, 1991; Macatonia et aL, Journal of Immunology 154:5071-5079, 1995; Porgador et 
aL, Journal of Experimental Medicine 182:255-260, 1995; Nair et aL, Journal of Virology 
67:4062-4069, 1993; Huang et aL, Science 264:961-965, 1994; Macatonia et aL, Journal of 
Experimental Medicine 169:1255-1264, 1989; Bhardwaj et aL, Journal of Clinical Investigation 

24 94:797-807, 1994; and Inaba et aL, Journal of Experimental Medicine 172:63 1-640, 1990. 

Assays for lymphocyte survival/apoptosis (which will identify, among others, proteins 
that prevent apoptosis after superantigen induction and proteins that regulate lymphocyte 
homeostasis) include, without limitation, those described in: Darzynkiewicz et al., Cytometry 

28 13:795-808, 1992; Gorczyca et aL, Leukemia 7:659-670, 1993; Gorczyca et al., Cancer Research 
53:1945-1951, 1993; Itoh et aL, Cell 66:233-243, 1991; Zachaxchuk, Journal of Immunology 
145:4037-4045, 1990; Zamai et al., Cytometry 14:891-897, 1993; Gorczyca et aL, International 
Journal of Oncology 1:639-648, 1992. 

32 Assays for proteins that influence early steps of T-cell commitment and development 

include, without limitation, those described in: Antica et aL, Blood 84:1 1 1-1 17, 1994; Fine et aL, 
Cellular Immunology 155:1 11-122, 1994; Galy et aL, Blood 85:2770-2778, 1995; Toki et al., 
Proc. Nat. Acad Sci. USA 88:7548-7551, 1991. 
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4.10.8 ACTIVIN/INHIBIN ACTIVITY 

A polypeptide of the present invention may also exhibit activin- or inhibin-related 
4 activities. A polynucleotide of the invention may encode a polypeptide exhibiting such 
characteristics. Inhibins are characterized by their ability to inhibit the release of follicle 
stimulating hormone (FSH), while activins and are characterized by their ability to stimulate the 
release of follicle stimulating hormone (FSH). Thus, a polypeptide of the present invention, 
8 alone or in heterodimers with a member of the inhibin family, may be useful as a contraceptive 
based on the ability of inhibins to decrease fertility in female mammals and decrease 
spermatogenesis in male mammals. Administration of sufficient amounts of other inhibins can 
induce infertility in these mammals. Alternatively, the polypeptide of the invention, as a 
12 homodimer or as a heterodimer with other protein subunits of the inhibin group, may be useful as 
a fertility inducing therapeutic, based upon the ability of activin molecules in stimulating FSH 
release from cells of the anterior pituitary. See, for example, U.S. Pat. No. 4,798,885. A 
polypeptide of the invention may also be useful for advancement of the onset of fertility in 
16 sexually immature mammals, so as to increase the lifetime reproductive performance of domestic 
animals such as, but not limited to, cows, sheep and pigs. 

The activity of a polypeptide of the invention may, among other means, be measured by 
the following methods. 

20 Assays for activin/inhibin activity include, without limitation, those described in: Vale et 

al., Endocrinology 91:562-572, 1972; Ling et al., Nature 321:779-782, 1986; Vale et al. 5 Nature 
321:776-779, 1986; Mason et al., Nature 318:659-663, 1985; Forage et aL, Proc. Natl. Acad. Sci. 
USA 83:3091-3095, 1986. 

24 

4.10.9 CHEMOTACTIC/CHEMOKINETIC ACTIVITY 

A polypeptide of the present invention may be involved in chemotactic or chemokinetic 
activity for mammalian cells, including, for example, monocytes, fibroblasts, neutrophils, 

28 T-cells, mast cells, eosinophils, epithelial and/or endothelial cells. A polynucleotide of the 

invention can encode a polypeptide exhibiting such attributes. Chemotactic and chemokinetic 
receptor activation can be used to mobilize or attract a desired cell population to a desired site of 
action. Chemotactic or chemokinetic compositions (e.g. proteins, antibodies, binding partners, or 

32 modulators of the invention) provide particular advantages in treatment of wounds and other 
trauma to tissues, as well as in treatment of localized infections. For example, attraction of 
lymphocytes, monocytes or neutrophils to tumors or sites of infection may result in improved 
immune responses against the tumor or infecting agent. 
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A protein or peptide has chemotactic activity for a particular cell population if it can 
stimulate, directly or indirectly, the directed orientation or movement of such cell population. 
Preferably, the protein or peptide has the ability to directly stimulate directed movement of cells. 
4 Whether a particular protein has chemotactic activity for a population of cells can be readily 
determined by employing such protein or peptide in any known assay for cell chemotaxis. 
Therapeutic compositions of the invention can be used in the following: 
Assays for chemotactic activity (which will identify proteins that induce or prevent 
8 chemotaxis) consist of assays that measure the ability of a protein to induce the migration of 
cells across a membrane as well as the ability of a protein to induce the adhesion of one cell 
population to another cell population. Suitable assays for movement and adhesion include, 
without limitation, those described in: Current Protocols in Immunology, Ed by J. E. Coligan, A. 
12 M. Kruisbeek, D. H. Marguiles, E. M Shevach, W. Strober, Pub. Greene Publishing Associates 
and Wiley-Interscience (Chapter 6.12, Measurement of alpha and beta Chemokines 
6.12.1-6.12.28; Taub et al. J. Clin. Invest. 95:1370-1376, 1995; Lind et al. APMIS 103:140-146, 
1995; Muller et al Eur. J. Immunol. 25:1744-1748; Gruber et al. J. of Immunol. 152:5860-5867, 
16 1994; Johnston et al. J. of Immunol. 153:1762-1768, 1994. 

4.10.10 HEMOSTATIC AND THROMBOLYTIC ACTIVITY 

A polypeptide of the invention may also be involved in hemostatis or thrombolysis or 
20 thrombosis. A polynucleotide of the invention can encode a polypeptide exhibiting such 

attributes. Compositions may be useful in treatment of various coagulation disorders (including 

hereditary disorders, such as hemophilias) or to enhance coagulation and other hemostatic events 

in treating wounds resulting from trauma, surgery or other causes. A composition of the 
24 invention may also be useful for dissolving or inhibiting formation of thromboses and for 

treatment and prevention of conditions resulting therefrom (such as, for example, infarction of 

cardiac and central nervous system vessels (e.g., stroke). 

Therapeutic compositions of the invention can be used in the following: 
28 Assay for hemostatic and thrombolytic activity include, without limitation, those 

described in: Linet et al., J. Clin. Pharmacol. 26:131-140, 1986; Burdick et al., Thrombosis Res. 

45:413-419, 1987; Humphrey et al., Fibrinolysis 5:71-79 (1991); Schaub, Prostaglandins 

35:467-474, 1988. 

32 

4.10.11 CANCER DIAGNOSIS AND THERAPY 

Polypeptides of the invention may be involved in cancer cell generation* proliferation or 
metastasis. Detection of the presence or amount of polynucleotides or polypeptides of the 
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invention may be useful for the diagnosis and/or prognosis of one or more types of cancer. For 
example, the presence or increased expression of a polynucleotide/polypeptide of the invention 
may indicate a hereditary risk of cancer, a precancerous condition, or an ongoing malignancy. 
4 Conversely, a defect in the gene or absence of the polypeptide may be associated with a cancer 
condition. Identification of single nucleotide polymorphisms associated with cancer or a 
predisposition to cancer may also be useful for diagnosis or prognosis. 

Cancer treatments promote tumor regression by inhibiting tumor cell proliferation, 
8 inhibiting angiogenesis (growth of new blood vessels that is necessary to support tumor growth) 
and/or prohibiting metastasis by reducing tumor cell motility or invasiveness. Therapeutic 
compositions of the invention may be effective in adult and pediatric oncology including in solid 
phase tumors/malignancies, locally advanced tumors, human soft tissue sarcomas, metastatic 

12 cancer, including lymphatic metastases, blood cell malignancies including multiple myeloma, 
acute and chronic leukemias, and lymphomas, head and neck cancers including mouth cancer, 
larynx cancer and thyroid cancer, lung cancers including small cell carcinoma and non-small cell 
cancers, breast cancers including small cell carcinoma and ductal carcinoma, gastrointestinal 

16 cancers including esophageal cancer, stomach cancer, colon cancer, colorectal cancer and polyps 
associated with colorectal neoplasia, pancreatic cancers,, liver cancer, urologic cancers including 
bladder cancer and prostate cancer, malignancies of the female genital tract including ovarian 
carcinoma, uterine (including endometrial) cancers, and solid tumor in the ovarian follicle, 

20 kidney cancers including renal cell carcinoma, brain cancers including intrinsic brain tumors, 
neuroblastoma, astrocytic brain tumors, gliomas, metastatic tumor cell invasion in the central 
nervous system, bone cancers including osteomas, skin cancers including malignant melanoma, 
tumor progression of human skin keratinocytes, squamous cell carcinoma, basal cell carcinoma, 

24 hemangiopericytoma and Karposi's sarcoma. 

Polypeptides, polynucleotides, or modulators of polypeptides of the invention (including 
inhibitors and stimulators of the biological activity of the polypeptide of the invention) may be 
administered to treat cancer. Therapeutic compositions can be administered in therapeutically 

28 effective dosages alone or in combination with adjuvant cancer therapy such as surgery, 

chemotherapy, radiotherapy, thermotherapy, and laser therapy, and may provide a beneficial 
effect, e.g. reducing tumor size, slowing rate of tumor growth, inhibiting metastasis, or otherwise 
improving overall clinical condition, without necessarily eradicating the cancer. 

32 The composition can also be administered in therapeutically effective amounts as a 

portion of an anti-cancer cocktail. An anti-cancer cocktail is a mixture of the polypeptide or 
modulator of the invention with one or more anti-cancer drugs in addition to a pharmaceutically 
acceptable carrier for delivery. The use of anti-cancer cocktails as a cancer treatment is routine. 
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Anti-cancer drugs that are well known in the art and can be used as a treatment in combination 
with the polypeptide or modulator of the invention include: Actinomycin D, Aminoglutethimide, 
Asparaginase, Bleomycin, Busulfan, Carboplatin, Camiustine, Chlorambucil, Cisplatin (cis- 
4 DDP), Cyclophosphamide, Cytaxabine HC1 (Cytosine arabinoside), Dacarbazine, Dactinomycin, 
Daunorubicin HC1, Doxorubicin HO, Estramustine phosphate sodium, Etoposide (V16-213), 
Floxuridine, 5-Fluorouracil (5-Fu), Flutamide, Hydroxyurea (hydroxycarbamide), Ifosfamide, 
Interferon Alpha-2a, Interferon Alpha-2b, Leuprolide acetate (LHRH-releasing factor analog), 
8 Lomustine, Mechlorethamine HC1 (nitrogen mustard), Melphalan, Mercaptopurine, Mesna, 

Methotrexate (MTX), Mitomycin, Mitoxantrone HC1, Octreotide, Plicamycin, Procarbazine HC1, 
Streptozocin, Tamoxifen citrate, Tliioguanine, Thiotepa, Vinblastine sulfate, Vincristine sulfate, 
Amsacrine, Azacitidine, Hexamethylmelamine, Interleukin-2, Mitoguazone, Pentostatin, 

12 Semustine, Teniposide, and Vindesine sulfate. 

In addition, therapeutic compositions of the invention may be used for prophylactic 
treatment of cancer. There are hereditary conditions and/or environmental situations (e.g. 
exposure to carcinogens) known in the art that predispose an individual to developing cancers. 

16 Under these circumstances, it may be beneficial to treat these individuals with therapeutically 
effective doses of the polypeptide of the invention to reduce the risk of developing cancers. 

In vitro models can be used to determine the effective doses of the polypeptide of the 
invention as a potential cancer treatment. These in vitro models include proliferation assays of 

20 cultured tumor cells, growth of cultured tumor cells in soft agar (see Freshney, (1987) Culture of 
Animal Cells: A Manual of Basic Technique, Wily-Liss, New York, NY Ch 18 and Ch 21), 
tumor systems in nude mice as described in Giovanella et al., J. Natl. Can. Inst., 52: 921-30 
(1974), mobility and invasive potential of tumor cells in Boyden Chamber assays as described in 

24 Pilkington et al., Anticancer Res., 17: 4107-9 (1997), and angiogenesis assays such as induction 
of vascularization of the chick chorioallantoic membrane or induction of vascular endothelial 
cell migration as described in Ribatta et al., Intl. J. Dev. Biol., 40: 1 189-97 (1999) and Li et al., 
Clin- Exp. Metastasis, 17:423-9 (1999), respectively. Suitable tumor cells lines are available, 

28 e.g. from American Type Tissue Culture Collection catalogs. 

4.10.12 RECEPTOR/LIGAND ACTIVITY 

A polypeptide of the present invention may also demonstrate activity as receptor, 
32 receptor ligand or inhibitor or agonist of receptor/ligand interactions. A polynucleotide of the 
invention can encode a polypeptide exhibiting such characteristics. Examples of such receptors 
and ligands include, without limitation, cytokine receptors and their ligands, receptor kinases and 
their ligands, receptor phosphatases and their ligands, receptors involved in cell-cell interactions 
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and their ligands (including without limitation, cellular adhesion molecules (such as selectins, 
integrins and their ligands) and receptor/ligand pairs involved in antigen presentation, antigen 
recognition and development of cellular and humoral immune responses. Receptors and ligands 
4 are also useful for screening of potential peptide or small molecule inhibitors of the relevant 
receptor/ligand interaction, A protein of the present invention (including, without limitation, 
fragments of receptors and ligands) may themselves be useful as inhibitors of receptor/ligand 
interactions. 

8 The activity of a polypeptide of the invention may, among other means, be measured by 

the following methods: 

Suitable assays for receptor-ligand activity include without limitation those described in: 

Current Protocols in Immunology, Ed by J. E, Coligan, A. M. Kruisbeek, D. H. Margulies, E. M. 
12 Shevach, W. Strober, Pub. Greene Publishing Associates and Wiley- Interscience (Chapter 7.28, 

Measurement of Cellular Adhesion under static conditions 7.28. 1- 7.28.22), Takai et aL, Proc. 

Natl. Acad. Sci. USA 84:6864-6868, 1987; Bierer et aL, J. Exp. Med. 168:1145-1156, 1988; 

Rosenstein et al., J. Exp. Med. 169:149-160 1989; Stoltenborg et al., J. Immunol. Methods 
16 175:59-68, 1994; Stitt et al., Cell 80:661-670, 1995. 

By way of example, the polypeptides of the invention may be used as a receptor for a 

ligand(s) thereby transmitting the biological activity of that ligand(s). Ligands may be identified 

through binding assays, affinity chromatography, dihybrid screening assays, BIAcore assays, gel 
20 overlay assays, or other methods known in the art. 

Studies characterizing drugs or proteins as agonist or antagonist or partial agonists or a 

partial antagonist require the use of other proteins as competing ligands. The polypeptides of the 

present invention or ligand(s) thereof may be labeled by being coupled to radioisotopes, 
24 colorimetric molecules or a toxin molecules by conventional methods. ("Guide to Protein 

Purification" Murray P. Deutscher (ed) Methods in Enzymology Vol. 182 (1990) Academic 

Press, Inc. San Diego). Examples of radioisotopes include, but are not limited to, tritium and 

carbon-14 . Examples of colorimetric molecules include, but are not limited to, fluorescent 
28 molecules such as fluorescamine, or rhodamine or other colorimetric molecules. Examples of 

toxins include, but are not limited, to ricin. 

4.10.13 DRUG SCREENING 

32 This invention is particularly useful for screening chemical compounds by using the 

novel polypeptides or binding fragments thereof in any of a variety of drug screening techniques. 
The polypeptides or fragments employed in such a test may either be free in solution, affixed to a 
solid support, borne on a cell surface or located intracellular!^ One method of drug screening 
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utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the polypeptide or a fragment thereof Drugs are screened against such 
transformed cells in competitive binding assays. Such cells, either in viable or fixed form, can 
4 be used for standard binding assays. One may measure, for example, the formation of 

complexes between polypeptides of the invention or fragments and the agent being tested or 
examine the diminution in complex formation between the novel polypeptides and an 
appropriate cell line, which are well known in the art. 
8 Sources for test compounds that may be screened for ability to bind to or modulate (/. e. , 

increase or decrease) the activity of polypeptides of the invention include (1) inorganic and 
organic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of either random or mimetic peptides, oligonucleotides or organic molecules. 

12 Chemical libraries may be readily synthesized or purchased from a number of 

commercial sources, and may include structural analogs of known compounds or compounds 
that are identified as "hits" or "leads" via natural product screening. 

The sources of natural product libraries are microorganisms (including bacteria and 

16 fungi), animals, plants or other vegetation, or marine organisms, and libraries of mixtures for 

screening may be created by: (1) fermentation and extraction of broths from soil, plant or marine 
microorganisms or (2) extraction of the organisms themselves. Natural product libraries include 
polyketides, non-ribosomal peptides, and (non-naturally occurring) variants thereof. For a 

20 review, see Science 252:63-68 (1 998). 

Combinatorial libraries are composed of large numbers of peptides, oligonucleotides or 
organic compounds and can be readily prepared by traditional automated synthesis methods, 
PGR, cloning or proprietary synthetic methods. Of particular interest are peptide and 

24 oligonucleotide combinatorial libraries. Still other libraries of interest include peptide, protein, 
peptidomimetic, multiparallel synthetic collection, recombinatorial, and polypeptide libraries. 
For a review of combinatorial chemistry and libraries created therefrom, see Myers, Curr. Opin. 
Biotechnol 8:701-707 (1997). For reviews and examples of peptidomimetic libraries, see 

28 Al-Obeidi et al., Mol Biotechnol 9(3):205-23 (1998); Hruby et al., Curr Opin Chem Biol, 

1(1):114-19 (1997); Dorner et al., BioorgMed Chem, 4(5):709-15 (1996) (alkylated dipeptides). 

Identification of modulators through use of the various libraries described herein permits 
modification of the candidate "hit" (or "lead") to optimize the capacity of the "hit" to bind a 

32 polypeptide of the invention. The molecules identified in the binding assay are then tested for 

antagonist or agonist activity in in vivo tissue culture or animal models that are well known in the 
art. In brief, the. molecules are titrated into a plurality of cell cultures or animals and then tested 
for either cell/animal death or prolonged survival of the animal/cells. 
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The binding molecules thus identified may be complexed with toxins, e.g., ricin or 
cholera, or with other compounds that are toxic to cells such as radioisotopes. The toxin-binding 
molecule complex is then targeted to a tumor or other cell by the specificity of the binding 
4 molecule for a polypeptide of the invention. Alternatively, the binding molecules may be 
complexed with imaging agents for targeting and imaging purposes. 

4.10.14 ASSAY FOR RECEPTOR ACTIVITY 

8 The invention also provides methods to detect specific binding of a polypeptide e.g. a 

ligand or a receptor. The art provides numerous assays particularly useful for identifying 
previously unknown binding partners for receptor polypeptides of the invention. For example, 
expression cloning using mammalian or bacterial cells, or dihybrid screening assays can be used 

12 to identify polynucleotides encoding binding partners. As another example, affinity 

chromatography with the appropriate immobilized polypeptide of the invention can be used to 
isolate polypeptides that recognize and bind polypeptides of the invention. There are a number 
of different libraries used for the identification of compounds, and in particular small molecules, 

1 6 that modulate (ie., increase or decrease) biological activity of a polypeptide of the invention. 
Ligands for receptor polypeptides of the invention can also be identified by adding exogenous 
ligands, or cocktails of ligands to two cells populations that are genetically identical except for 
the expression of the receptor of the invention: one cell population expresses the receptor of the 

20 invention whereas the other does not. The response of the two cell populations to the addition of 
ligands(s) are then compared. Alternatively, an expression library can be co-expressed with the 
polypeptide of the invention in cells and assayed for an autocrine response to identify potential 
ligand(s). As still another example, BIAcore assays, gel overlay assays, or other methods known 

24 in the art can be used to identify binding partner polypeptides, including, (1) organic and 
inorganic chemical libraries, (2) natural product libraries, and (3) combinatorial libraries 
comprised of random peptides, oligonucleotides or organic molecules. 

The role of downstream intracellular signaling molecules in the signaling cascade of the 

28 polypeptide of the invention can be determined. For example, a chimeric protein in which the 
cytoplasmic domain of the polypeptide of the invention is fused to the extracellular portion of a 
protein, whose ligand has been identified, is produced in a host cell. The cell is then incubated 
with the ligand specific for the extracellular portion of the chimeric protein, thereby activating 

32 the chimeric receptor. Known downstream proteins involved in intracellular signaling can then 
be assayed for expected modifications i.e. phosphorylation. Other methods known to those in the 
art can also be used to identify signaling molecules involved in receptor activity. 
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4.10.15 



ANTI-INFLAMMATORY ACTIVITY 



Compositions of the present invention may also exhibit anti-inflammatory activity. The 
anti -inflammatory activity may be achieved by providing a stimulus to cells involved in the 
4 inflammatory response, by inhibiting or promoting cell-cell interactions (such as, for example, 
cell adhesion), by inhibiting or promoting chemotaxis of cells involved in the inflammatory 
process, inhibiting or promoting cell extravasation, or by stimulating or suppressing production 
of other factors which more directly inhibit or promote an inflammatory response. Compositions 
8 with such activities can be used to treat inflammatory conditions including chronic or acute 
conditions), including without limitation intimation associated with infection (such as septic 
shock, sepsis or systemic inflammatory response syndrome (SIRS)), ischemia-reperfusion injury, 
endotoxin lethality, arthritis, complement-mediated hyperacute rejection, nephritis, cytokine or 

12 chemokine-induced lung injury, inflammatory bowel disease, Crohn's disease or resulting from 
over production of cytokines such as TNF or IL-1 . Compositions of the invention may also be 
useful to treat anaphylaxis and hypersensitivity to an antigenic substance or material. 
Compositions of this invention may be utilized to prevent or treat conditions such as, but not 

1 6 limited to, sepsis, acute pancreatitis, endotoxin shock, cytokine induced shock, rheumatoid 

arthritis, chronic inflammatory arthritis, pancreatic cell damage from diabetes mellitus type 1, 
graft versus host disease, inflammatory bowel disease, inflamation associated with pulmonary 
disease, other autoimmune disease or inflammatory disease, an antiproliferative agent such as for 

20 acute or chronic mylegenous leukemia or in the prevention of premature labor secondary to 
intrauterine infections. 



therapeutic that promotes or inhibits function of the polynucleotides and/or polypeptides of the 
invention. Such leukemias and related disorders include but are not limited to acute leukemia, 
acute lymphocytic leukemia, acute myelocytic leukemia, myeloblastic, promyelocyte, 
28 myelomonocytic, monocytic, erythroleukemia, chronic leukemia, chronic myelocytic 

(granulocytic) leukemia and chronic lymphocytic leukemia (for a review of such disorders, see 
Fishman et al., 1985, Medicine, 2d Ed., IB. Lippincott Co., Philadelphia). 

32 4.10.17 NERVOUS SYSTEM DISORDERS 

Nervous system disorders, involving cell types which can be tested for efficacy of 
intervention with compounds that modulate the activity of the polynucleotides and/or 
polypeptides of the invention, and which can be treated upon thus observing an indication of 



4.10.16 



LEUKEMIAS 



24 



Leukemias and related disorders may be treated or prevented by administration of a 
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therapeutic utility, include but are not limited to nervous system injuries, and diseases or 
disorders which result in either a disconnection of axons, a diminution or degeneration of 
neurons, or demyelination. Nervous system lesions which may be treated in a patient (including 
4 human and non-human mammalian patients) according to the invention include but are not 

limited to the following lesions of either the central (including spinal cord, brain) or peripheral 
nervous systems: 

(i) traumatic lesions, including lesions caused by physical injury or associated with 
8 surgery, for example, lesions which sever a portion of the nervous system, or compression 

injuries; 

(ii) ischemic lesions, in which a lack of oxygen in a portion of the nervous system 
results in neuronal injury or death, including cerebral infarction or ischemia, or spinal cord 

12 infarction or ischemia; 

(iii) infectious lesions, in which a portion of the nervous system is destroyed or 
injured as a result of infection, for example, by an abscess or associated with infection by human 
immunodeficiency virus, herpes zoster, or herpes simplex virus or with Lyme disease, 

16 tuberculosis, syphilis; 

(iv) degenerative lesions, in which a portion of the nervous system is destroyed or 
injured as a result of a degenerative process including but not limited to degeneration associated 
with Parkinson's disease, Alzheimer's disease, Huntington's chorea, or amyotrophic lateral 

20 sclerosis; 

(v) lesions associated with nutritional diseases or disorders, in which a portion of the 
nervous system is destroyed or injured by a nutritional disorder or disorder of metabolism 
including but not limited to, vitamin B12 deficiency, folic acid deficiency, Wernicke disease, 

24 tobacco-alcohol amblyopia, Marchiafava-Bignami disease (primary degeneration of the corpus 
callosum), and alcoholic cerebellar degeneration; 

(vi) neurological lesions associated with systemic diseases including but not limited to 
diabetes (diabetic neuropathy, Bell's palsy), systemic lupus erythematosus, carcinoma, or 

28 sarcoidosis; 

(vii) lesions caused by toxic substances including alcohol, lead, or particular 
neurotoxins; and 

(viii) demyelinated lesions in which a portion of the nervous system is destroyed or 
32 injured by a demyelinating disease including but not limited to multiple sclerosis, human 

immunodeficiency virus-associated myelopathy, transverse myelopathy or various etiologies, 
progressive multifocal leukoencephalopathy, and central pontine myelinolysis. 
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Therapeutics which are useful according to the invention for treatment of a nervous 
system disorder may be selected by testing for biological activity in promoting the survival or 
differentiation of neurons. For example, and not by way of limitation, therapeutics which elicit 
4 any of the following effects may be useful according to the invention: 

(i) increased survival time of neurons in culture; 

(ii) increased sprouting of neurons in culture or in vivo; 

(iii) increased production of a neuron-associated molecule in culture or in vivo, e.g., 
8 choline acetyltransferase or acetylcholinesterase with respect to motor neurons; or 

(iv) decreased symptoms of neuron dysfunction in vivo. 

Such effects may be measured by any method known in the art. In preferred, 
non-limiting embodiments, increased survival of neurons may be measured by the method set 

12 forth in Arakawa et al. (1990, J. Neurosci. 10:3507-3515); increased sprouting of neurons may 
be detected by methods set forth in Pestronk et al. (1980, Exp. Neurol. 70:65-82) or Brown et al. 
(1981, Ann. Rev. Neurosci. 4:17-42); increased production of neuron-associated molecules may 
be measured by bioassay, enzymatic assay, antibody binding, Northern blot assay, etc., 

16 depending on the molecule to be measured; and motor neuron dysfunction may be measured by 
assessing the physical manifestation of motor neuron disorder, e.g., weakness, motor neuron 
conduction velocity, or functional disability. 

In specific embodiments, motor neuron disorders that may be treated according to the 

20 invention include but are not limited to disorders such as infarction, infection, exposure to toxin, 
trauma, surgical damage, degenerative disease or malignancy that may affect motor neurons as 
well as other components of the nervous system, as well as disorders that selectively affect 
neurons such as amyotrophic lateral sclerosis, and including but not limited to progressive spinal 

24 muscular atrophy, progressive bulbar palsy, primary lateral sclerosis, infantile and juvenile 
muscular atrophy, progressive bulbar paralysis of childhood (Fazio-Londe syndrome), 
poliomyelitis and the post polio syndrome, and Hereditary Motorsensory Neuropathy 
(Charcot-Marie-Tooth Disease). 

28 

4.10.18 OTHER ACTIVITIES 

A polypeptide of the invention may also exhibit one or more of the following additional 
activities or effects: inhibiting the growth, infection or function of, or killing, infectious agents, 
32 including, without limitation, bacteria, viruses, fungi and other parasites; effecting (suppressing 
or enhancing) bodily characteristics, including, without limitation, height, weight, hair color, eye 
color, skin, fat to lean ratio or other tissue pigmentation, or organ or body part size or shape 
(such as, for example, breast augmentation or diminution, change in bone form or shape); 
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effecting biorhythms or circadian cycles or rhythms; effecting the fertility of male or female 
subjects; effecting the metabolism, catabolism, anabolism, processing, utilization, storage or 
elimination of dietary fat, lipid, protein, carbohydrate, vitamins, minerals, co-factors or other 

4 nutritional factors or component(s); effecting behavioral characteristics, including, without 
limitation, appetite, libido, stress, cognition (including cognitive disorders), depression 
(including depressive disorders) and violent behaviors; providing analgesic effects or other pain 
reducing effects; promoting differentiation and growth of embryonic stem cells in lineages other 

8 than hematopoietic lineages; hormonal or endocrine activity; in the case of enzymes, correcting 
deficiencies of the enzyme and treating deficiency-related diseases; treatment of 
hyperproliferative disorders (such as, for example, psoriasis); immunoglobulin-like activity (such 
as, for example, the ability to bind antigens or complement); and the ability to act as an antigen 
12 in a vaccine composition to raise an immune response against such protein or another material or 
entity which is cross-reactive with such protein. 

4.10.19 IDENTIFICATION OF POLYMORPHISMS 

16 The demonstration of polymorphisms makes possible the identification of such 

polymorphisms in human subjects and the pharmacogenetic use of this information for diagnosis 
and treatment. Such polymorphisms may be associated with, e.g., differential predisposition or 
susceptibility to various disease states (such as disorders involving inflammation or immune 

20 response) or a differential response to drug administration, and this genetic information can be 
used to tailor preventive or therapeutic treatment appropriately. For example, the existence of a 
polymorphism associated with a predisposition to inflammation or autoimmune disease makes 
possible the diagnosis of this condition in humans by identifying the presence of the 

24 polymorphism. 

Polymorphisms can be identified in a variety of ways known in the art which all 
generally involve obtaining a sample from a patient, analyzing DNA from the sample, optionally 
involving isolation or amplification of the DNA, and identifying the presence of the 

28 polymorphism in the DNA. For example, PCR may be used to amplify an appropriate fragment 
of genomic DNA which may then be sequenced. Alternatively, the DNA may be subjected to 
allele-specific oligonucleotide hybridization (in which appropriate oligonucleotides are 
hybridized to the DNA under conditions permitting detection of a single base mismatch) or to a 

32 single nucleotide extension assay (in which an oligonucleotide that hybridizes immediately 

adjacent to the position of the polymorphism is extended with one or more labeled nucleotides). 
In addition, traditional restriction fragment length polymorphism analysis (using restriction 
enzymes that provide differential digestion of the genomic DNA depending on the presence or 
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absence of the polymorphism) may be performed. Arrays with nucleotide sequences of the 
present invention can be used to detect polymorphisms. The array can comprise modified 
nucleotide sequences of the present invention in order to detect the nucleotide sequences of the 

4 present invention. In the alternative, any one of the nucleotide sequences of the present 
invention can be placed on the array to detect changes from those sequences. 

Alternatively a polymorphism resulting in a change in the amino acid sequence could 
also be detected by detecting a corresponding change in amino acid sequence of the protein, e.g., 

8 by an antibody specific to the variant sequence. 

4.10.20 ARTHRITIS AND INFLAMMATION 

The immunosuppressive effects of the compositions of the invention against rheumatoid 

12 arthritis is determined in an experimental animal model system. The experimental model system 
is adjuvant induced arthritis in rats, and the protocol is described by J. Holoshitz, et at., 1 983, 
Science, 219:56, or by B. Waksman et ah, 1963, Int. Arch. Allergy AppL Immunol., 23:129. 
Induction of the disease can be caused by a single injection, generally intradermally, of a 

16 suspension of killed Mycobacterium tuberculosis in complete Freund's adjuvant (CFA). The 
route of injection can vary, but rats may be injected at the base of the tail with an adjuvant 
mixture. The polypeptide is administered in phosphate buffered solution (PBS) at a dose of about 
1-5 mg/kg. The control consists of administering PBS only. 

20 The procedure for testing the effects of the test compound would consist of intradermally 

injecting killed Mycobacterium tuberculosis in CFA followed by immediately administering the 
test compound and subsequent treatment every other day until day 24. At 14, 15, 18, 20, 22, and 
24 days after injection of Mycobacterium CFA, an overall arthritis score may be obtained as 

24 described by J. Holoskitz above. An analysis of the data would reveal that the test compound 
would have a dramatic affect on the swelling of the joints as measured by a decrease of the 
arthritis score. 

28 4.11 THERAPEUTIC METHODS 

The compositions (including polypeptide fragments, analogs, variants and antibodies or 
other binding partners or modulators including antisense polynucleotides) of the invention have 
numerous applications in a variety of therapeutic methods. Examples of therapeutic applications 
32 include, but are not limited to, those exemplified herein. 

4.11-1 EXAMPLE 
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One embodiment of the invention is the administration of an effective amount of the 
polypeptides or other composition of the invention to individuals affected by a disease or 
disorder that can be modulated by regulating the peptides of the invention. While the mode of 
4 administration is not particularly important, parenteral administration is preferred. An 
exemplary mode of administration is to deliver an intravenous bolus. The dosage of the 
polypeptides or other composition of the invention will normally be determined by the 
prescribing physician. It is to be expected that the dosage will vary according to the age, weight, 
8 condition and response of the individual patient. Typically, the amount of polypeptide 

administered per dose will be in the range of about O.Olj-ig/kg to 100 mg/kg of body weight, with 
the preferred dose being about 0. 1 (J-g/kg to 1 0 mg/kg of patient body weight. For parenteral 
administration, polypeptides of the invention will be formulated in an injectable form combined 

12 with a pharmaceutically acceptable parenteral vehicle. Such vehicles are well known in the art 
and examples include water, saline, Ringer's solution, dextrose solution, and solutions consisting 
of small amounts of the human serum albumin. The vehicle may contain minor amounts of 
additives that maintain the isotonicity and stability of the polypeptide or other active ingredient. 

16 The preparation of such solutions is within the skill of the art. 

4 .12 PHARMACEUTICAL FORMULATIONS AND ROUTES OF 
ADMINISTRATION 

20 A protein or other composition of the present invention (from whatever source derived, 

including without limitation from recombinant and non-recombinant sources and including 
antibodies and other binding partners of the polypeptides of the invention) may be administered 
to a patient in need, by itself, or in pharmaceutical compositions where it is mixed with suitable 

24 carriers or excipient(s) at doses to treat or ameliorate a variety of disorders. Such a composition 
may optionally contain (in addition to protein or other active ingredient and a carrier) diluents, 
fillers, salts, buffers, stabilizers, solubilizers, and other materials well known in the art. The term 
"pharmaceutically acceptable" means a non-toxic material that does not interfere with the 

28 effectiveness of the biological activity of the active ingredient(s). The characteristics of the 
carrier will depend on the route of administration. The pharmaceutical composition of the 
invention may also contain cytokines, lymphokines, or other hematopoietic factors such as 
M-CSF, GM-CSF, TNF, IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-1 1, IL-1 2, 

32 IL-13, IL-14, IL-15, IFN, TNF0, TNF1, TNF2, G-CSF, Meg-CSF, thrombopoietin, stem cell 
factor, and erythropoietin. In further compositions, proteins of the invention may be combined 
with other agents beneficial to the treatment of the disease or disorder in question. These agents 
include various growth factors such as epidermal growth factor (EGF), platelet-derived growth 
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factor (PDGF), transforming growth factors (TGF-a and TGF-|3), insulin-like growth factor 
(IGF), as well as cytokines described herein. 

The pharmaceutical composition may further contain other agents which either enhance 
4 the activity of the protein or other active ingredient or complement its activity or use in 
treatment. Such additional factors and/or agents may be included in the pharmaceutical 
composition to produce a synergistic effect with protein or other active ingredient of the 
invention, or to minimize side effects. Conversely, protein or other active ingredient of the 
8 present invention may be included in formulations of the particular clotting factor, cytokine, 
lymphokine, other hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti- 
inflammatory agent to minimize side effects of the clotting factor, cytokine, lymphokine, other 
hematopoietic factor, thrombolytic or anti-thrombotic factor, or anti-inflammatory agent (such as 

12 IL-IRa, IL-1 Hyl, IL-1 Hy2, anti-TNF, corticosteroids, immunosuppressive agents). A protein 
of the present invention may be active in multimers (e.g., heterodimers or homodimers) or 
complexes with itself or other proteins. As a result, pharmaceutical compositions of the 
invention may comprise a protein of the invention in such multimeric or complexed form. 

16 As an alternative to being included in a pharmaceutical composition of the invention 

including a first protein, a second protein or a therapeutic agent may be concurrently 
administered with the first protein (e.g., at the same time, or at differing times provided that 
therapeutic concentrations of the combination of agents is achieved at the treatment site). 

20 Techniques for formulation and administration of the compounds of the instant application may 
be found in "Remington's Pharmaceutical Sciences," Mack Publishing Co., Easton, PA, latest 
edition. A therapeutically effective dose further refers to that amount of the compound sufficient 
to result in amelioration of symptoms, e.g., treatment, healing, prevention or amelioration of the 

24 relevant medical condition, or an increase in rate of treatment, healing, prevention or 

amelioration of such conditions. When applied to an individual active ingredient, administered 
alone, a therapeutically effective dose refers to that ingredient alone. When applied to a 
combination, a therapeutically effective dose refers to combined amounts of the active 

28 ingredients that result in the therapeutic effect, whether administered in combination, serially or 
simultaneously. 

In practicing the method of treatment or use of the present invention, a therapeutically 
effective amount of protein or other active ingredient of the present invention is administered to 
32 a mammal having a condition to be treated. Protein or other active ingredient of the present 

invention may be administered in accordance with the method of the invention either alone or in 
combination with other therapies such as treatments employing cytokines, lymphokines or other 
hematopoietic factors. When co- administered with one or more cytokines, lymphokines or other 
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hematopoietic factors, protein or other active ingredient of the present invention may be 
administered either simultaneously with the cytokine(s), lymphokine(s), other hematopoietic 
factor(s), thrombolytic or anti-thrombotic factors, or sequentially. If administered sequentially, 
4 the attending physician will decide on the appropriate sequence of administering protein or other 
active ingredient of the present invention in combination with cytokine(s), lymphokine(s), other 
hematopoietic factor(s), thrombolytic or anti-thrombotic factors. 

8 4.12.1 ROUTES OF ADMINISTRATION 

Suitable routes of administration may , for example, include oral, rectal, transmucosal, or 
intestinal administration; parenteral delivery, including intramuscular, subcutaneous, 
intramedullary injections, as well as intrathecal, direct intraventricular, intravenous, 

12 intraperitoneal, intranasal, or intraocular injections. Administration of protein or other active 
ingredient of the present invention used in the pharmaceutical composition or to practice the 
method of the present invention can be carried out in a variety of conventional ways, such as oral 
ingestion, inhalation, topical application or cutaneous, subcutaneous, intraperitoneal, parenteral 

16 or intravenous injection. Intravenous administration to the patient is preferred. 

Alternately, one may administer the compound in a local rather than systemic manner, for 
example, via injection of the compound directly into a arthritic joints or in fibrotic tissue, often 
in a depot or sustained release formulation. In order to prevent the scarring process frequently 

20 occurring as complication of glaucoma surgery, the compounds may be administered topically, 
, for example, as eye drops. Furthermore, one may administer the drug in a targeted drug delivery 
system, for example, in a liposome coated with a specific antibody, targeting, for example, 
arthritic or fibrotic tissue. The liposomes will be targeted to and taken up selectively by the 

24 afflicted tissue. 

The polypeptides of the invention are administered by any route that delivers an effective 
dosage to the desired site of action. The determination of a suitable route of administration and 
an effective dosage for a particular indication is within the level of skill in the art. Preferably for 
28 wound treatment, one administers the therapeutic compound directly to the site. Suitable dosage 
ranges for the polypeptides of the invention can be extrapolated from these dosages or from 
similar studies in appropriate animal models. Dosages can then be adjusted as necessary by the 
clinician to provide maximal therapeutic benefit 

32 

4.12.2 COMPOSITIONS/FORMULATIONS 

Pharmaceutical compositions for use in accordance with the present invention thus may 
be formulated in a conventional manner using one or more physiologically acceptable carriers 
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comprising excipients and auxiliaries which facilitate processing of the active compounds into 
preparations which can be used pharmaceutically. These pharmaceutical compositions may be 
manufactured in a manner that is itself known, e.g. , by means of conventional mixing, 
4 dissolving, granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping or 
lyophilizing processes. Proper formulation is dependent upon the route of administration 
chosen. When a therapeutically effective amount of protein or other active ingredient of the 
present invention is administered orally, protein or other active ingredient of the present 
8 invention will be in the form of a tablet, capsule, powder, solution or elixir. When administered 
in tablet form, the pharmaceutical composition of the invention may additionally contain a solid 
carrier such as a gelatin or an adjuvant. The tablet, capsule, and powder contain from about 5 to 
95% protein or other active ingredient of the present invention, and preferably from about 25 to 

12 90% protein or other active ingredient of the present invention. When administered in liquid 
form, a liquid carrier such as water, petroleum, oils of animal or plant origin such as peanut oil, 
mineral oil, soybean oil, or sesame oil, or synthetic oils may be added. The liquid form of the 
pharmaceutical composition may further contain physiological saline solution, dextrose or other 

16 saccharide solution, or glycols such as ethylene glycol, propylene glycol or polyethylene glycol. 
When administered in liquid form, the pharmaceutical composition contains from about 0,5 to 
90% by weight of protein or other active ingredient of the present invention, and preferably from 
about 1 to 50% protein or other active ingredient of the present invention. 

20 When a therapeutically effective amount of protein or other active ingredient of the 

present invention is administered by intravenous, cutaneous or subcutaneous injection, protein or 
other active ingredient of the present invention will be in the form of a pyrogen-free, parenterally 
acceptable aqueous solution. The preparation of such parenterally acceptable protein or other 

24 active ingredient solutions, having due regard to pH, isotonicity, stability, and the like, is within 
the skill in the art. A preferred pharmaceutical composition for intravenous, cutaneous, or 
subcutaneous injection should contain, in addition to protein or other active ingredient of the 
present invention, an isotonic vehicle such as Sodium Chloride Injection, Ringer's Injection, 

28 Dextrose Injection, Dextrose and Sodium Chloride Injection, Lactated Ringer's Injection, or 

other vehicle as known in the art. The pharmaceutical composition of the present invention may 
also contain stabilizers, preservatives, buffers, antioxidants, or other additives known to those of 
skill in the art. For injection, the agents of the invention may be formulated in aqueous 

32 solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's 

solution, or physiological saline buffer. For transmucosal administration, penetrants appropriate 
to the barrier to be permeated are used in the formulation. Such penetrants are generally known 
in the art. 
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For oral administration, the compounds can be formulated readily by combining the 
active compounds with pharmaceutically acceptable carriers well known in the art. Such carriers 
enable the compounds of the invention to be formulated as tablets, pills, dragees, capsules, 
4 liquids, gels, syrups, slurries, suspensions and the like, for oral ingestion by a patient to be 
treated. Pharmaceutical preparations for oral use can be obtained from a solid excipient, 
optionally grinding a resulting mixture, and processing the mixture of granules, after adding 
suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in 
8 particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose 

preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, 
gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium 
carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents 

12 may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt 
thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this 
purpose, concentrated sugar solutions may be used, which may optionally contain gum arabic, 
talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 

16 solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be 

added to the tablets or dragee coatings for identification or to characterize different combinations 
of active compound doses. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 

24 suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 

stabilizers may be added. All formulations for oral administration should be in dosages suitable 
for such administration. For buccal administration, the compositions may take the form of 
tablets or lozenges formulated in conventional manner. 

28 For administration by inhalation, the compounds for use according to the present 

invention are conveniently delivered in the form of an aerosol spray presentation from 
pressurized packs or a nebuliser, with the use of a suitable propellant, e.g. , 
dichlorodifluoromethane, trichlorofluoromethane, dichlorotetrafluoroethane, carbon dioxide or 

32 other suitable gas. In the case of a pressurized aerosol the dosage unit may be determined by 
providing a valve to deliver a metered amount. Capsules and cartridges of, e.g., gelatin for use 
in an inhaler or insufflator may be formulated containing a powder mix of the compound and a 
suitable powder base such as lactose or starch. The compounds may be formulated for parenteral 
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administration by injection, e.g., by bolus injection or continuous infusion. Formulations for 
injection may be presented in unit dosage form, e.g., in ampules or in multi-dose containers, with 
an added preservative. The compositions may take such forms as suspensions, solutions or 
4 emulsions in oily or aqueous vehicles, and may contain formulatory agents such as suspending, 
stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Additionally, suspensions of the active compounds 
8 may be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or 

vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or 
triglycerides, or liposomes. Aqueous injection suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 

12 dextran. Optionally, the suspension may also contain suitable stabilizers or agents which 
increase the solubility of the compounds to allow for the preparation of highly concentrated 
solutions. Alternatively, the active ingredient may be in powder form for constitution with a 
suitable vehicle, e.g., sterile pyrogen-free water, before use. 

16 The compounds may also be formulated in rectal compositions such as suppositories or 

retention enemas, e.g., containing conventional suppository bases such as cocoa butter or other 
glycerides. In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 

20 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 
Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as 
sparingly soluble derivatives, for example, as a sparingly soluble salt. 

24 A pharmaceutical carrier for the hydrophobic compounds of the invention is a co-solvent 

system comprising benzyl alcohol, a nonpolar surfactant, a water-miscible organic polymer, and 
an aqueous phase. The co-solvent system may be the VPD co-solvent system. VPD is a solution 
of 3% w/v benzyl alcohol, 8% w/v of the nonpolar surfactant polysorbate 80, and 65% w/v 

28 polyethylene glycol 300, made up to volume in absolute ethanol. The VPD co-solvent system 
(VPD:5W) consists of VPD diluted 1:1 with a 5% dextrose in water solution. This co-solvent 
system dissolves hydrophobic compounds well, and itself produces low toxicity upon systemic 
administration. Naturally, the proportions of a co-solvent system may be varied considerably 

32 without destroying its solubility and toxicity characteristics. Furthermore, the identity of the 

co-solvent components may be varied: for example, other low-toxicity nonpolar surfactants may 
be used instead of polysorbate 80; the fraction size of polyethylene glycol may be varied; other 
biocompatible polymers may replace polyethylene glycol, e.g. polyvinyl pyrrolidone; and other 
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sugars or polysaccharides may substitute for dextrose. Alternatively, other delivery systems for 
hydrophobic pharmaceutical compounds may be employed. Liposomes and emulsions are well 
known examples of delivery vehicles or carriers for hydrophobic drugs. Certain organic solvents 
4 such as dimethylsulfoxide also may be employed, although usually at the cost of greater toxicity. 
Additionally, the compounds may be delivered using a sustained-release system, such as 
semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. 
Various types of sustained-release materials have been established and are well known by those 
8 skilled in the art. Sustained-release capsules may, depending on their chemical nature, release the 
compounds for a few weeks up to over 100 days. Depending on the chemical nature and the 
biological stability of the therapeutic reagent, additional strategies for protein or other active 
ingredient stabilization may be employed. 

12 The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 

or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. Many of the active ingredients of the invention may be 

16 provided as salts with pharmaceutically compatible counter ions. Such pharmaceutically 
acceptable base addition salts are those salts which retain the biological effectiveness and 
properties of the free acids and which are obtained by reaction with inorganic or organic bases 
such as sodium hydroxide, magnesium hydroxide, ammonia, trialkylamine, dialkylamine, 

20 monoalkylamine, dibasic amino acids, sodium acetate, potassium benzoate, triethanol amine and 
the like. 

The pharmaceutical composition of the invention may be in the form of a complex of the 
protein(s) or other active ingredient(s) of present invention along with protein or peptide 

24 antigens. The protein and/or peptide antigen will deliver a stimulatory signal to both B and T 
lymphocytes. B lymphocytes will respond to antigen through their surface immunoglobulin 
receptor. T lymphocytes will respond to antigen through the T cell receptor (TCR) following 
presentation of the antigen by MHC proteins. MHC and structurally related proteins including 

28 those encoded by class I and class II MHC genes on host cells will serve to present the peptide 
antigen(s) to T lymphocytes. The antigen components could also be supplied as purified 
MHC-peptide complexes alone or with co-stimulatory molecules that can directly signal T cells. 
Alternatively antibodies able to bind surface immunoglobulin and other molecules on B cells as 

32 well as antibodies able to bind the TCR and other molecules on T cells can be combined with the 
pharmaceutical composition of the invention. 

The pharmaceutical composition of the invention may be in the form of a liposome in 
which protein of the present invention is combined, in addition to other pharmaceutically 
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acceptable carriers, with amphipathic agents such as lipids which exist in aggregated form as 
micelles, insoluble monolayers, liquid crystals, or lamellar layers in aqueous solution. Suitable 
lipids for liposomal formulation include, without limitation, monoglycerides, diglycerides, 
4 sulfatides, lysolecithins, phospholipids, saponin, bile acids, and the like. Preparation of such 
liposomal formulations is within the level of skill in the art, as disclosed, for example, in U.S. 
Patent Nos. 4,235,871; 4,501,728; 4,837,028; and 4,737,323, all of which are incorporated 
herein by reference. 

8 The amount of protein or other active ingredient of the present invention in the 

pharmaceutical composition of the present invention will depend upon the nature and severity of 
the condition being treated, and on the nature of prior treatments which the patient has 
undergone. Ultimately, the attending physician will decide the amount of protein or other active 

12 ingredient of the present invention with which to treat each individual patient. Initially, the 

attending physician will administer low doses of protein or other active ingredient of the present 
invention and observe the patient's response. Larger doses of protein or other active ingredient 
of the present invention may be administered until the optimal therapeutic effect is obtained for 

16 the patient, and at that point the dosage is not increased further. It is contemplated that the 

various pharmaceutical compositions used to practice the method of the present invention should 
contain about 0.01 jug to about 100 mg (preferably about 0.1 p.g to about 10 mg, more preferably 
about 0. 1 fig to about 1 mg) of protein or other active ingredient of the present invention per kg 

20 body weight. For compositions of the present invention which are useful for bone, cartilage, 

tendon or ligament regeneration, the therapeutic method includes administering the composition 
topically, systematically, or locally as an implant or device. When administered, the therapeutic 
composition for use in this invention is, of course, in a pyrogen-free, physiologically acceptable 

24 form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable 
for wound healing and tissue repair. Therapeutically useful agents other than a protein or other 
active ingredient of the invention which may also optionally be included in the composition as 

28 described above, may alternatively or additionally, be administered simultaneously or 

sequentially with the composition in the methods of the invention. Preferably for bone and/or 
cartilage formation, the composition would include a matrix capable of delivering the 
protein-containing or other active ingredient-containing composition to the site of bone and/or 

32 cartilage damage, providing a structure for the developing bone and cartilage and optimally 

capable of being resorbed into the body. Such matrices may be formed of materials presently in 
use for other implanted medical applications. 
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The choice of matrix material is based on biocompatibility, biodegradability, mechanical 
properties, cosmetic appearance and interface properties. The particular application of the 
compositions will define the appropriate formulation. Potential matrices for the compositions 
4 may be biodegradable and chemically defined calcium sulfate, tricalcium phosphate, 

hydroxyapatite, polylactic acid, polyglycolic acid and polyanhydrides. Other potential materials 
are biodegradable and biologically well-defined, such as bone or dermal collagen. Further 
matrices are comprised of pure proteins or extracellular matrix components. Other potential 
8 matrices are nonbiodegradable and chemically defined, such as sintered hydroxyapatite, bioglass, 
aluminates, or other ceramics. Matrices may be comprised of combinations of any of the above 
mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and 
tricalcium phosphate. The bioceramics may be altered in composition, such as in 

12 calcium-aluminate-phosphate and processing to alter pore size, particle size, particle shape, and 
biodegradability. Presently preferred is a 50:50 (mole weight) copolymer of lactic acid and 
gly colic acid in the form of porous particles having diameters ranging from 1 50 to 800 microns. 
In some applications, it will be useful to utilize a sequestering agent, such as carboxymethyl 

16 cellulose or autologous blood clot, to prevent the protein compositions from disassociating from 
the matrix. 

A preferred family of sequestering agents is cellulosic materials such as alkylcelluloses 
(including hydroxyalkylcelluloses), including methylcellulose, ethylcellulose, 

20 hydroxyethyicellulose, hydroxypropylcellulose, hydroxypropyl-methylcellulose, and 

carboxymethylcellulose, the most preferred being cationic salts of carboxymethylcellulose 
(CMC). Other preferred sequestering agents include hyaluronic acid, sodium alginate, 
poly(ethylene glycol), polyoxyethylene oxide, carboxyvinyl polymer and poly(vinyl alcohol). 

24 The amount of sequestering agent useful herein is 0.5-20 wt % ? preferably 1-10 wt % based on 
total formulation weight, which represents the amount necessary to prevent desorption of the 
protein from the polymer matrix and to provide appropriate handling of the composition, yet not 
so much that the progenitor cells are prevented from infiltrating the matrix, thereby providing the 

28 protein the opportunity to assist the osteogenic activity of the progenitor cells. In further 

compositions, proteins or other active ingredients of the invention may be combined with other 
agents beneficial to the treatment of the bone and/or cartilage defect, wound, or tissue in 
question. These agents include various growth factors such as epidermal growth factor (EGF), 

32 platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-|3), and 
insulin-like growth factor (IGF). 

The therapeutic compositions are also presently valuable for veterinary applications. 
Particularly domestic animals and thoroughbred horses, in addition to humans, are desired 
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patients for such treatment with proteins or other active ingredients of the present invention. The 
dosage regimen of a protein-containing pharmaceutical composition to be used in tissue 
regeneration will be determined by the attending physician considering various factors which 
4 modify the action of the proteins, e.g., amount of tissue weight desired to be formed, the site of 
damage, the condition of the damaged tissue, the size of a wound, type of damaged tissue (e.g., 
bone), the patient's age, sex, and diet, the severity of any infection, time of administration and 
other clinical factors. The dosage may vary with the type of matrix used in the reconstitution 
8 and with inclusion of other proteins in the pharmaceutical composition. For example, the 

addition of other known growth factors, such as IGF I (insulin like growth factor I), to the final 
composition, may also effect the dosage. Progress can be monitored by periodic assessment of 
tissue/bone growth and/or repair, for example, X-rays, histomorphometric determinations and 

12 tetracycline labeling. 

Polynucleotides of the present invention can also be used for gene therapy. Such 
polynucleotides can be introduced either in vivo or ex vivo into cells for expression in a 
mammalian subject. Polynucleotides of the invention may also be administered by other known 

16 methods for introduction of nucleic acid into a cell or organism (including, without limitation, in 
the form of viral vectors or naked DNA). Cells may also be cultured ex vivo in the presence of 
proteins of the present invention in order to proliferate or to produce a desired effect on or 
activity in such cells. Treated cells can then be introduced in vivo for therapeutic purposes. 

20 

4.12.3 EFFECTIVE DOSAGE 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve its 

24 intended purpose. More specifically, a therapeutically effective amount means an amount 

effective to prevent development of or to alleviate the existing symptoms of the subject being 
treated. Determination of the effective amount is well within the capability of those skilled in 
the art, especially in light of the detailed disclosure provided herein. For any compound used in 

28 the method of the invention, the therapeutically effective dose can be estimated initially from 

appropriate in vitro assays. For example, a dose can be formulated in animal models to achieve a 
circulating concentration range that can be used to more accurately determine useful doses in 
humans. For example, a dose can be formulated in animal models to achieve a circulating 

32 concentration range that includes the IC 50 as determined in cell culture (i.e., the concentration of 
the test compound which achieves a half-maximal inhibition of the protein's biological activity). 
Such information can be used to more accurately determine useful doses in humans. 
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A therapeutically effective dose refers to that amount of the compound that results in 
amelioration of symptoms or a prolongation of survival in a patient. Toxicity and therapeutic 
efficacy of such compounds can be determined by standard pharmaceutical procedures in cell 
4 cultures or experimental animals, e.g., for determining the LD50 (the dose lethal to 50% of the 
population) and the ED 5 o (the dose therapeutically effective in 50% of the population). The dose 
ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio between LD50 and ED50. Compounds which exhibit high therapeutic indices are preferred. 
8 The data obtained from these cell culture assays and animal studies can be used in formulating a 
range of dosage for use in human. The dosage of such compounds lies preferably within a range 
of circulating concentrations that include the ED50 with little or no toxicity. The dosage may 
vary within this range depending upon the dosage foam employed and the route of administration 

1 2 utilized. The exact formulation, route of administration and dosage can be chosen by the . 
individual physician in view of the patient's condition. See, e.g., Fingl et al., 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1 p.l. Dosage amount and interval may be adjusted 
individually to provide plasma levels of the active moiety which are sufficient to maintain the 

16 desired effects, or minimal effective concentration (MEC). The MEC will vary for each 

compound but can be estimated from in vitro data. Dosages necessary to achieve the MEC will 
depend on individual characteristics and route of administration. However, HPLC assays or 
bioassays can be used to determine plasma concentrations. 

20 Dosage intervals can also be determined using MEC value. Compounds should be 

administered using a regimen which maintains plasma levels above the MEC for 10-90% of the 
time, preferably between 30-90% and most preferably between 50-90%. In cases of local 
administration or selective uptake, the effective local concentration of the drug may not be 

24 related to plasma concentration. 

An exemplary dosage regimen for polypeptides or other compositions of the invention 
will be in the range of about 0.01 jxg/kg to 100 mg/kg of body weight daily, with the preferred 
dose being about 0. 1 pg/kg to 25 mg/kg of patient body weight daily, varying in adults and 

28 children. Dosing may be once daily, or equivalent doses may be delivered at longer or shorter 
intervals. 

The amount of composition administered will, of course, be dependent on the subject 
being treated, on the subject's age and weight, the severity of the affliction, the manner of 
32 administration and the judgment of the prescribing physician. 

4.12.4 PACKAGING 
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The compositions may, if desired, be presented in a pack or dispenser device which may 
contain one or more unit dosage forms containing the active ingredient. The pack may, for 
example, comprise metal or plastic foil, such as a blister pack. The pack or dispenser device may 
4 be accompanied by instructions for administration. Compositions comprising a compound of the 
invention formulated in a compatible pharmaceutical carrier may also be prepared, placed in an 
appropriate container, and labeled for treatment of an indicated condition. 

8 4.13 ANTIBODIES 

Also included in the invention are antibodies to proteins, or fragments of proteins of the 
invention. The term "antibody" as used herein refers to immunoglobulin molecules and 
immunologically active portions of immunoglobulin (Ig) molecules, i.e, 7 molecules that contain 

12 an antigen binding site that specifically binds (immunoreacts with) an antigen. Such antibodies 
include, but are not limited to, polyclonal, monoclonal, chimeric, single chain, F a b, F a b' and F( a b')2 
fragments, and an F a b expression library. In general, an antibody molecule obtained from 
humans relates to any of the classes IgG, IgM, IgA, IgE and IgD, which differ from one another 

16 by the nature of the heavy chain present in the molecule. Certain classes have subclasses as well, 
such as IgGj, IgG2, and others. Furthermore, in humans, the light chain may be a kappa chain or 
a lambda chain. Reference herein to antibodies includes a reference to all such classes, 
subclasses and types of human antibody species. 

20 An isolated related protein of the invention may be intended to serve as an antigen, or a 

portion or fragment thereof, and additionally can be used as an immunogen to generate 
antibodies that immunospecifically bind the antigen, using standard techniques for polyclonal 
and monoclonal antibody preparation. The full-length protein can be used or, alternatively, the 

24 invention provides antigenic peptide fragments of the antigen for use as immunogens. An 

antigenic peptide fragment comprises at least 6 amino acid residues of the amino acid sequence 
of the full length protein, (for example the amino acid sequence shown in SEQ ID NO: 1351), 
and encompasses an epitope thereof such that an antibody raised against the peptide forms a 

28 specific immune complex with the full length protein or with any fragment that contains the 

epitope. Preferably, the antigenic peptide comprises at least 10 amino acid residues, or at least 
1 5 amino acid residues, or at least 20 amino acid residues, or at least 30 amino acid residues. 
Preferred epitopes encompassed by the antigenic peptide are regions of the protein that are 

32 located on its surface; commonly these are hydrophilic regions. 

In certain embodiments of the invention, at least one epitope encompassed by the 
antigenic peptide is a region of -related protein that is located on the surface of the protein, e.g., a 
hydrophilic region. A hydrophobicity analysis of the human related protein sequence will 
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indicate which regions of a related protein are particularly hydrophilic and, therefore, are likely 
to encode surface residues useful for targeting antibody production. As a means for targeting 
antibody production, hydropathy plots showing regions of hydrophilicity and hydrophobicity 
4 may be generated by any method well known in the art, including, for example, the Kyte 

Doolittle or the Hopp Woods methods, either with or without Fourier transformation. See, e.g., 
Hopp and Woods, 1981, Proc. Nat. Acad. Sci USA 78: 3824-3828; Kyte and Doolittle 1982, J. 
Mol. Biol. 157: 105-142, each of which is incorporated herein by reference in its entirety. 
8 Antibodies that are specific for one or more domains within an antigenic protein, or derivatives, 
fragments, analogs or homologs thereof, are also provided herein. 

A protein of the invention, or a derivative, fragment, analog, homolog or ortholog 
thereof, may be utilized as an immunogen in the generation of antibodies that 
12 immunospecifically bind these protein components. 

Various procedures known within the art may be used for the production of polyclonal or 
monoclonal antibodies directed against a protein of the invention, or against derivatives, 
fragments, analogs homologs or orthologs thereof (see, for example, Antibodies: A Laboratory 
16 Manual, Harlow E, and Lane D, 1988, Cold Spring Harbor Laboratory Press, Cold Spring 

Harbor, NY, incorporated herein by reference). Some of these antibodies are discussed below. 

5.13.1 Polyclonal Antibodies 

20 For the production of polyclonal antibodies, various suitable host animals {e.g., rabbit, 

goat, mouse or other mammal) may be immunized by one or more injections with the native 
protein, a synthetic variant thereof, or a derivative of the foregoing. An appropriate 
immunogenic preparation can contain, for example, the naturally occurring immunogenic 

24 protein, a chemically synthesized polypeptide representing the immunogenic protein, or a 

recombinantly expressed immunogenic protein. Furthermore, the protein may be conjugated to 
a second protein known to be immunogenic in the mammal being immunized. Examples of such 
immunogenic proteins include but are not limited to keyhole limpet hemocyanin, serum albumin, 

28 bovine thyroglobulin, and soybean trypsin inhibitor. The preparation can further include an 
adjuvant. Various adjuvants used to increase the ^immunological response include, but are not 
limited to, Freund's (complete and incomplete), mineral gels (e.g., aluminum hydroxide), surface 
active substances (e.g., lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, 

32 dinitrophenol, etc.), adjuvants usable in humans such as Bacille Calmette-Guerin and 

Corynebacterium parvum, or similar immunostimulatory agents. Additional examples of 
adjuvants which can be employed include MPL-TDM adjuvant (monophosphoryl Lipid A, 
synthetic trehalose dicorynomycolate). 
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The polyclonal antibody molecules directed against the immunogenic protein can be 
isolated from the mammal (e.g., from the blood) and further purified by well known techniques, 
such as affinity chromatography using protein A or protein G ? which provide primarily the IgG 

4 fraction of immune serum. Subsequently, or alternatively, the specific antigen which is the 

target of the immunoglobulin sought, or an epitope thereof, may be immobilized on a column to 
purify the immune specific antibody by immunoaffinity chromatography. Purification of 
immunoglobulins is discussed, for example, by D. Wilkinson (The Scientist, published by The 

8 Scientist, Inc., Philadelphia PA, Vol. 14, No. 8 (April 17, 2000), pp. 25-28). 

5.13.2 Monoclonal Antibodies 

The term "monoclonal antibody" (MAb) or "monoclonal antibody composition", as used 

12 herein, refers to a population of antibody molecules that contain only one molecular species of 
antibody molecule consisting of a unique light chain gene product and a unique heavy chain 
gene product. In particular, the complementarity determining regions (CDRs) of the monoclonal 
antibody are identical in all the molecules of the population. MAbs thus contain an antigen 

16 binding site capable of immunoreacting with a particular epitope of the antigen characterized by 
a unique binding affinity for it. 

Monoclonal antibodies can be prepared using hybridoma methods, such as those 
described by Kohler and Milstein, Nature . 256 :495 (1975). In a hybridoma method, a mouse, 

20 hamster, or other appropriate host animal, is typically immunized with an immunizing agent to 
elicit lymphocytes that produce or are capable of producing antibodies that will specifically bind 
to the immunizing agent. Alternatively, the lymphocytes can be immunized in vitro. 

The immunizing agent will typically include the protein antigen, a fragment thereof or a 

24 fusion protein thereof. Generally, either peripheral blood lymphocytes are used if cells of human 
origin are desired, or spleen cells or lymph node cells are used if non-human mammalian sources 
are desired. The lymphocytes are then fused with an immortalized cell line using a suitable 
fusing agent, such as polyethylene glycol, to form a hybridoma cell (Goding, Monoclonal 

28 Antibodies: Principles and Practice . Academic Press, (1986) pp. 59-103). Immortalized cell 

lines are usually transformed mammalian cells, particularly myeloma cells of rodent, bovine and 
human origin. Usually, rat or mouse myeloma cell lines are employed. The hybridoma cells can 
be cultured in a suitable culture medium that preferably contains one or more substances that 

32 inhibit the growth or survival of the unfused, immortalized cells. For example, if the parental 
cells lack the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the 
culture medium for the hybridomas typically will include hypoxanthine, aminopterin, and 
thymidine ("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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Preferred immortalized cell lines are those that fuse efficiently, support stable high level 
expression of antibody by the selected antibody-producing cells, and are sensitive to a medium 
such as HAT medium. More preferred immortalized cell lines are murine myeloma lines, which 

4 can be obtained, for instance, from the Salk Institute Cell Distribution Center, San Diego, 

California and the American Type Culture Collection, Manassas, Virginia. Human myeloma and 
mouse-human heteromyeloma cell lines also have been described for the production of human 
monoclonal antibodies (Kozbor, J. Immunol- 133 :3001 (1984); Brodeur et aL, Monoclonal 

8 Antibody Production Techniques and Applications . Marcel Dekker, Inc., New York, (1 987) pp. 
51-63). 

The culture medium in which the hybridoma cells are cultured can then be assayed for 
the presence of monoclonal antibodies directed against the antigen. Preferably, the binding 

12 specificity of monoclonal antibodies produced by the hybridoma cells is determined by 
immunoprecipitation or by an in vitro binding assay, such as radioimmunoassay (RIA) or 
enzyme-linked immunoabsorbent assay (ELISA). Such techniques and assays are known in the 
art. The binding affinity of the monoclonal antibody can, for example, be determined by the 

1 6 Scatchard analysis of Munson and Pollard, Anal. Biochem. . 107 :220 (1980). Preferably, 

antibodies having a high degree of specificity and a high binding affinity for the target antigen 
are isolated. 

After the desired hybridoma cells are identified, the clones can be subcloned by limiting 

20 dilution procedures and grown by standard methods. Suitable culture media for this purpose 
include, for example, Dulbecco's Modified Eagle's Medium and RPMI-1640 medium. 
Alternatively, the hybridoma cells can be grown in vivo as ascites in a mammal. 

The monoclonal antibodies secreted by the subclones can be isolated or purified from the 

24 culture medium or ascites fluid by conventional immunoglobulin purification procedures such 
as, for example, protein A-Sepharose, hydroxylapatite chromatography, gel electrophoresis, 
dialysis, or affinity chromatography. 

The monoclonal antibodies can also be made by recombinant DNA methods, such as 

28 those described in U.S. Patent No. 4,816,567. DNA encoding the monoclonal antibodies of the 
invention can be readily isolated and sequenced using conventional procedures (e.g., by using 
oligonucleotide probes that are capable of binding specifically to genes encoding the heavy and 
light chains of murine antibodies). The hybridoma cells of the invention serve as a preferred 

32 source of such DNA. Once isolated, the DNA can be placed into expression vectors, which are 
then transfected into host cells such as simian COS cells, Chinese hamster ovary (CHO) cells, or 
myeloma cells that do not otherwise produce immunoglobulin protein, to obtain the synthesis of 
monoclonal antibodies in the recombinant host cells. The DNA also can be modified, for 



77 



WO 01/64835 



PCT/US01/04927 



example, by substituting the coding sequence for human heavy and light chain constant domains 
in place of the homologous murine sequences (U.S. Patent No. 4,816,567; Morrison, Nature 368 , 
812-13 (1994)) or by covalently joining to the immunoglobulin coding sequence all or part of the 
4 coding sequence for a non-immunoglobulin polypeptide. Such a non-immunoglobulin 

polypeptide can be substituted for the constant domains of an antibody of the invention, or can 
be substituted for the variable domains of one antigen-combining site of an antibody of the 
invention to create a chimeric bivalent antibody. 

8 

5.13,2 Humanized Antibodies 

The antibodies directed against the protein antigens of the invention can further comprise 
humanized antibodies or human antibodies. These antibodies are suitable for administration to 

1 2 humans without engendering an immune response by the human against the administered 
immunoglobulin. Humanized forms of antibodies are chimeric immunoglobulins, 
immunoglobulin chains or fragments thereof (such as Fv 3 Fab, Fab', F(ab')2 or other antigen- 
binding subsequences of antibodies) that are principally comprised of the sequence of a human 

16 immunoglobulin, and contain minimal sequence derived from a non-human immunoglobulin. 
Humanization can be performed following the method of Winter and co-workers (Jones et al., 
Nature , 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et al., 
Science , 239:1534-1536 (1988)), by substituting rodent CDRs or CDR sequences for the 

20 corresponding sequences of a human antibody. (See also U.S. Patent No. 5,225,539.) In some 
instances, Fv framework residues of the human immunoglobulin are replaced by corresponding 
non-human residues. Humanized antibodies can also comprise residues which are found neither 
in the recipient antibody nor in the imported CDR or framework sequences. In general, the 

24 humaiiized antibody will comprise substantially all of at least one, and typically two, variable 

domains, in which all or substantially all of the CDR regions correspond to those of a non-human 
immunoglobulin and all or substantially all of the framework regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise at 

28 least a portion of an immunoglobulin constant region (Fc) 5 typically that of a human 

immunoglobulin (Jones et al., 1986; Riechmann et al., 1988; and Presta, Curr. Op. Struct. Biol. . 
2:593-596 (1992)). 

32 5.133 Human Antibodies 

Fully human antibodies relate to antibody molecules in which essentially the entire 
sequences of both the light chain and the heavy chain, including the CDRs, arise from human 
genes. Such antibodies are termed "human antibodies", or "fully human antibodies" herein. 
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Human monoclonal antibodies can be prepared by the trioma technique; the human B-cell 
hybridoma technique (see Kozbor, et aL, 1983 Immunol Today 4: 72) and the EBV hybridoma 
technique to produce human monoclonal antibodies (see Cole, et aL, 1985 In: Monoclonal 
4 Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). Human monoclonal 

antibodies may be utilized in the practice of the present invention and may be produced by using 
human hybridomas (see Cote, et aL, 1983. Proc Natl Acad Sci USA 80: 2026-2030) or by 
transforming human B-cells with Epstein Barr Virus in vitro (see Cole, et aL, 1985 In: 
8 Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). 

In addition, human antibodies can also be produced using additional techniques, 
including phage display libraries (Hoogenboom and Winter, J. Mol. Biol. , 227 :381 (1991); 
Marks et aL, J. Mol. BioL . 222 :581 (1991)). Similarly, human antibodies can be made by 

12 introducing human immunoglobulin loci into transgenic animals, e.g. , mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 
challenge, human antibody production is observed, which closely resembles that seen in humans 
in all respects, including gene rearrangement, assembly, and antibody repertoire. This approach 

16 is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 5,569,825; 5,625,126; 

5,633,425; 5,661,016, and in Marks et al. (Bio/Technology 10, 779-783 (1992)); Lonberg et aL 
(Nature 368 856-859 (1994)); Morrison ( Nature 368, 812-13 (1994)); Fishwild et al,( Nature 
Biotechnology 14. 845-51 (1996)); Neuberger (Nature Biotechnology 14, 826 (1996)); and 

20 Lonberg and Huszar (Intern. Rev. Immunol. 13 65-93 (1 995)). 

Human antibodies may additionally be produced using transgenic nonhuman animals 
which are modified so as to produce fully human antibodies rather than the animal's endogenous 
antibodies in response to challenge by an antigen. (See PCT publication WO94/02602). The 

24 endogenous genes encoding the heavy and light immunoglobulin chains in the nonhuman host 
have been incapacitated, and active loci encoding human heavy and light chain immunoglobulins 
are inserted into the host's genome. The human genes are incorporated, for example, using yeast 
artificial chromosomes containing the requisite human DNA segments. An animal which 

28 provides all the desired modifications is then obtained as progeny by crossbreeding intermediate 
transgenic animals containing fewer than the full complement of the modifications. The 
preferred embodiment of such a nonhuman animal is a mouse, and is termed the Xenomouse™ 
as disclosed in PCT publications WO 96/33735 and WO 96/34096. This animal produces B 

32 cells which secrete fully human immunoglobulins. The antibodies can be obtained directly from 
the animal after immunization with an immunogen of interest, as, for example, a preparation of a 
polyclonal antibody, or alternatively from immortalized B cells derived from the animal, such as 
hybridomas producing monoclonal antibodies. Additionally, the genes encoding the 
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immunoglobulins with human variable regions can be recovered and expressed to obtain the 

antibodies directly, or can be further modified to obtain analogs of antibodies such as, for 

example, single chain Fv molecules. 
4 An example of a method of producing a nonhuman host, exemplified as a mouse, lacking 

expression of an endogenous immunoglobulin heavy chain is disclosed in U.S. Patent No. 

5,939,598. It can be obtained by a method including deleting the J segment genes from at least 

one endogenous heavy chain locus in an embryonic stem cell to prevent rearrangement of the 
8 locus and to prevent formation of a transcript of a rearranged immunoglobulin heavy' chain locus, 

the deletion being effected by a targeting vector containing a gene encoding a selectable marker; 

and producing from the embryonic stem cell a transgenic mouse whose somatic and germ cells 

contain the gene encoding the selectable marker. 
12 A method for producing an antibody of interest, such as a human antibody, is disclosed in 

U.S. Patent No. 5,916^771 . It includes introducing an expression vector that contains a 

nucleotide sequence encoding a heavy chain into one mammalian host cell in culture, introducing 

an expression vector containing a nucleotide sequence encoding a light chain into another 
16 mammalian host cell, and fusing the two cells to form a hybrid cell. The hybrid cell expresses an 

antibody containing the heavy chain and the light chain. 

In a further improvement on this procedure, a method for identifying a clinically relevant 

epitope on an immunogen, and a correlative method for selecting an antibody that binds 
20 immunospecifically to the relevant epitope with high affinity, are disclosed in PCT publication 

WO 99/53049. 

5.13.4 F a b Fragments and Single Chain Antibodies 

24 According to the invention, techniques can be adapted for the production of single-chain 

antibodies specific to an antigenic protein of the invention (see e.g., U.S. Patent No. 4,946,778). 
In addition, methods can be adapted for the construction of F a b expression libraries (see e.g., 
Huse, et al. 3 1989 Science 246: 1275-1281) to allow rapid and effective identification of 

28 monoclonal F a b fragments with the desired specificity for a protein or derivatives, fragments, 

analogs or homologs thereof. Antibody fragments that contain the idiotypes to a protein antigen 
may be produced by techniques known in the art including, but not limited to: (i) an F( ab -)2 
fragment produced by pepsin digestion of an antibody molecule; (ii) an F a b fragment generated 

32 by reducing the disulfide bridges of an F( a b')2 fragment; (iii) an F a b fragment generated by the 
treatment of the antibody molecule with papain and a reducing agent and (iv) F v fragments. 

5.13-5 Bispecific Antibodies 
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Bispecific antibodies are monoclonal, preferably human or humanized, antibodies that 
have binding specificities for at least two different antigens. In the present case, one of the 
binding specificities is for an antigenic protein of the invention. The second binding target is any 
4 other antigen, and advantageously is a cell-surface protein or receptor or receptor subunit. 

Methods for making bispecific antibodies are known in the art. Traditionally, the 
recombinant production of bispecific antibodies is based on the co-expression of two 
immunoglobulin heavy-chain/light-chain pairs, where the two heavy chains have different 
8 specificities (Milstein and Cuello, Nature. 305:537-539 (1983)). Because of the random 

assortment of immunoglobulin heavy and light chains, these hybridomas (quadromas) produce a 
potential mixture of ten different antibody molecules, of which only one has the correct 
bispecific structure. The purification of the correct molecule is usually accomplished by affinity 

12 chromatography steps. Similar procedures are disclosed in WO 93/08829, published 13 May 
1993, and inTraunecker et al, 1991 EMBOJ., 10:3655-3659. 

Antibody variable domains with the desired binding specificities (antibody-antigen 
combining sites) can be fused to immunoglobulin constant domain sequences. The fusion 

16 preferably is with an immunoglobulin heavy-chain constant domain, comprising at least part of 
the hinge, CH2, and CH3 regions. It is preferred to have the first heavy-chain constant region 
(CHI) containing the site necessary for light-chain binding present in at least one of the fusions. 
DNAs encoding the immunoglobulin heavy-chain fusions and, if desired, the immunoglobulin 

20 light chain, are inserted into separate expression vectors, and are co-transfected into a suitable 
host organism. For further details of generating bispecific antibodies see, for example, Suresh et 
al., Methods in Enzymology , 121:210 (1986). 

According to another approach described in WO 96/2701 1, the interface between a pair 

24 of antibody molecules can be engineered to maximize the percentage of heterodimers which are 
recovered from recombinant cell culture. The preferred interface comprises at least a part of the 
CH3 region of an antibody constant domain. In this method, one or more small amino acid side 
chains from the interface of the first antibody molecule are replaced with larger side chains (e.g. 

28 tyrosine or tryptophan). Compensatory "cavities" of identical or similar size to the large side 
chain(s) are created on the interface of the second antibody molecule by replacing large amino 
acid side chains with smaller ones (e.g. alanine or threonine). This provides a mechanism for 
increasing the yield of the heterodimer over other unwanted end-products such as homodimers. 

32 Bispecific antibodies can be prepared as full length antibodies or antibody fragments (e.g. 

F(ab') 2 bispecific antibodies). Techniques for generating bispecific antibodies from antibody 
fragments have been described in the literature. For example, bispecific antibodies can be 
prepared using chemical linkage. Brennan et al., Science 229:81 (1985) describe a procedure 
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wherein intact antibodies are proteolytically cleaved to generate F(ab') 2 fragments. These 
fragments are reduced in the presence of the dithiol complexing agent sodium arsenite to 
stabilize vicinal dithiols and prevent intermolecular disulfide formation. The Fab' fragments 
4 generated are then converted to thionitrobenzoate (TNB) derivatives. One of the Fab'-TNB 
derivatives is then reconverted to the Fab' -thiol by reduction with mercaptoethylamine and is 
mixed with an equimolar amount of the other Fab'-TNB derivative to form the bispecific 
antibody. The bispecific antibodies produced can be used as agents for the selective 
8 immobilization of enzymes. 

Additionally, Fab' fragments can be directly recovered from E. coli and chemically 
coupled to form bispecific antibodies. Shalaby et al., J. Exp. Med. 175:217-225 (1992) describe 
the production of a fully humanized bispecific antibody F(ab') 2 molecule. Each Fab' fragment 

12 was separately secreted from E. coli and subjected to directed chemical coupling in vitro to form 
the bispecific antibody. The bispecific antibody thus formed was able to bind to cells 
overexpressing the ErbB2 receptor and normal human T cells, as well as trigger the lytic activity 
of human cytotoxic lymphocytes against human breast tumor targets. 

16 Various techniques for making and isolating bispecific antibody fragments directly from 

recombinant cell culture have also been described. For example, bispecific antibodies have been 
produced using leucine zippers. Kostelny et al., J. Immunol. 148(5):1547-1553 (1992). The 
leucine zipper peptides from the Fos and Jun proteins were linked to the Fab' portions of two 

20 different antibodies by gene fusion. The antibody homodimers were reduced at the hinge region 
to form monomers and then re-oxidized to form the antibody heterodimers. This method can 
also be utilized for the production of antibody homodimers. The "diabody" technology 
described by Hollinger et al., Proc. Natl. Acad. Sci. USA 90:6444-6448 (1993) has provided an 

24 alternative mechanism for making bispecific antibody fragments. The fragments comprise a 
heavy-chain variable domain (Vh) connected to a light-chain variable domain (V L ) by a linker 
which is too short to allow pairing between the two domains on the same chain. Accordingly, 
the Vh and Vl domains of one fragment are forced to pair with the complementary V L and Vh 

28 domains of another fragment, thereby forming two antigen-binding sites. Another strategy for 
making bispecific antibody fragments by the use of single-chain Fv (sFv) dimers has also been 
reported. See, Gruber et al., J. Immunol. 152:5368 (1994). 

Antibodies with more than two valencies are contemplated. For example, trispecific 

32 antibodies can be prepared. Tutt et al., J. Immunol. 147:60 (1 991). 

Exemplary bispecific antibodies can bind to two different epitopes, at least one of which 
originates in the protein antigen of the invention. Alternatively, an anti-antigenic arm of an 
immunoglobulin molecule can be combined with an arm which binds to a triggering molecule on 
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a leukocyte such as a T~cell receptor molecule (e.g. CD2, CD3, CD28, or B7), or Fc receptors for 
IgG(Fc R), suchasFc RI (CD64), Fc RII (CD32) and Fc RIII (CD16) so as to focus 
cellular defense mechanisms to the cell expressing the particular antigen. Bispecific antibodies 
4 can also be used to direct cytotoxic agents to cells which express a particular antigen. These 
antibodies possess an antigen-binding arm and an arm which binds a cytotoxic agent or a 
radionuclide chelator, such as EOTUBE, DPTA, DOTA, or TETA. Another bispecific antibody 
of interest binds the protein antigen described herein and further binds tissue factor (TF). 

8 

5.13.6 Heteroconjugate Antibodies 

Heteroconjugate antibodies are also within the scope of the present invention. 
Heteroconjugate antibodies are composed of two covalently joined antibodies. Such antibodies 

12 have, for example, been proposed to target immune system cells to unwanted cells (U.S. Patent 
No. 4,676,980), and for treatment of HIV infection (WO 91/00360; WO 92/200373; EP 03089). 
It is contemplated that the antibodies can be prepared in vitro using known methods in synthetic 
protein chemistry, including those involving crosslinking agents. For example, immunotoxins 

1 6 can be constructed using a disulfide exchange reaction or by forming a thioether bond. 
Examples of suitable reagents for this purpose include iminothiolate and methyl-4- 
mercaptobutyrimidate and those disclosed,* for example, in U.S. Patent No. 4,676,980. 

20 5.13.7 Effector Function Engineering 

It can be desirable to modify the antibody of the invention with respect to effector function, so as 
to enhance, e.g., the effectiveness of the antibody in treating cancer. For example, cysteine 
residue(s) can be introduced into the Fc region, thereby allowing interchain disulfide bond 

24 formation in this region. The homodimeric antibody thus generated can have improved 

internalization capability and/or increased complement-mediated cell killing and antibody- 
dependent cellular cytotoxicity (ADCC). See Caron et al. 3 J. Exp Med., 176: 1191-1195 (1992) 
and Shopes, J. Immunol., 148: 2918-2922 (1992). Homodimeric antibodies with enhanced anti- 

28 tumor activity can also be prepared using heterobifunctional cross-linkers as described in Wolff 
et aL Cancer Research, 53: 2560-2565 (1993). Alternatively, an antibody can be engineered that 
has dual Fc regions and can thereby have enhanced complement lysis and ADCC capabilities. 
See Stevenson et al., Anti-Cancer Drug Design, 3: 219-230 (1989). 

32 

5.13*8 Immunoconjugates 

The invention also pertains to immunoconjugates comprising an antibody conjugated to a 
cytotoxic agent such as a chemotherapeutic agent, toxin (e.g., an enzymatically active toxin of 
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bacterial, fungal, plant, or animal origin, or fragments thereof), or a radioactive isotope (i.e., a 
radioconjugate). 

Chemotherapeutic agents useful in the generation of such immunoconjugate.s have been 
4 described above. Enzymatically active toxins and fragments thereof that can be used include 
diphtheria A chain, nonbinding active fragments of diphtheria toxin, exotoxin A chain (from 
Pseudomonas aeruginosa), ricin A chain, abrin A chain, modeccin A chain, alpha-sarcin, 
Aleurites fordii proteins, dianthin proteins, Phytolaca americana proteins (PAPI, PAPII, and 
8 PAP-S), momordica charantia inhibitor, curcin, crotin, sapaonaria officinalis inhibitor, gelonin, 
mitogellin, restrictocin, phenomycin, enomycin, and the tricothecenes. A variety of 
radionuclides are available for the production of radioconjugated antibodies. Examples include 
212 Bi, I31 1, 131 In, 9 °Y,and 186 Re. 

12 Conjugates of the antibody and cytotoxic agent are made using a variety of bifunctional 

protein-coupling agents such as N-succinimidyl-3-(2-pyridyldithiol) propionate (SPDP), 
iminothiolane (IT), bifunctional derivatives of imidoesters (such as dimethyl adipimidate HCL), 
active esters (such as disuccinimidyl suberate), aldehydes (such as glutareldehyde), bis-azido 

16 compounds (such as bis (p-azidobenzoyl) hexanediamine), bis-diazonium derivatives (such as 
bis-(p-diazoniumbenzoyl)-ethylenediamine), diisocyanates (such as tolyene 2,6-diisocyanate), 
and bis-active fluorine compounds (such as l,5-difluoro-2,4-dinitrobenzene). For example, a 
ricin immunotoxin can be prepared as described in Vitetta et al., Science, 238: 1098 (1987). 

20 Carbon- 14-labeled l-isothiocyanatobenzyl-3-methyldiethylene triaminepentaacetic acid (MX- 
DTPA) is an exemplary chelating agent for conjugation of radionucleotide to the antibody. See 
WO94/11026. 

In another embodiment, the antibody can be conjugated to a "receptor" (such 
24 streptavidin) for utilization in tumor pretargeting wherein the antibody-receptor conjugate is 
administered to the patient, followed by removal of unbound conjugate from the circulation 
using a clearing agent and then administration of a "ligand" (e.g., avidin) that is in turn 
conjugated to a cytotoxic agent. 

28 

4.14 COMPUTER READABLE SEQUENCES 

In one application of this embodiment, a nucleotide sequence of the present invention can 
be recorded on computer readable media. As used herein, "computer readable media" refers to 
32 any medium which can be read and accessed directly by a computer. Such media include, but 
are not limited to: magnetic storage media, such as floppy discs, hard disc storage medium, and 
magnetic tape; optical storage media such as CD-ROM; electrical storage media such as RAM 
and ROM; and hybrids of these categories such as magnetic/optical storage media. A skilled 
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artisan can readily appreciate how any of the presently known computer readable mediums can 
be used to create a manufacture comprising computer readable medium having recorded thereon 
a nucleotide sequence of the present invention. As used herein, "recorded" refers to a process for 
4 storing information on computer readable medium. A skilled artisan can readily adopt any of the 
presently known methods for recording information on computer readable medium to generate 
manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a 
8 computer readable medium having recorded thereon a nucleotide sequence of the present 

invention. The choice of the data storage structure will generally be based on the means chosen 
to access the stored information. In addition, a variety of data processor programs and formats 
can be used to store the nucleotide sequence information of the present invention on computer 

12 readable medium. The sequence information can be represented in a word processing text file, 
formatted in commercially-available software such as WordPerfect and Microsoft Word, or 
represented in the form of an ASCII file, stored in a database application, such as DB2, Sybase, 
Oracle, or the like. A skilled artisan can readily adapt any number of data processor structuring 

16 formats (e.g. text file or database) in order to obtain computer readable medium having recorded 
thereon the nucleotide sequence information of the present invention. 

By providing any of the nucleotide sequences SEQ ID NO: 1-13901 or a representative 
fragment thereof; or a nucleotide sequence at least 95% identical to any of the nucleotide 

20 sequences of SEQ ID NO: 1-13901 in computer readable form, a skilled artisan can routinely 
access the sequence information for a variety of purposes. Computer software is publicly 
available which allows a skilled artisan to access sequence information provided in a computer 
readable medium. The examples which follow demonstrate how software which implements the 

24 BLAST (Altschul et al., J. Mol. Biol. 215:403-410 (1990)) and BLAZE (Brutlag et al., Comp. 

Chem. 17:203-207 (1993)) search algorithms on a Sybase system is used to identify open reading 
frames (ORFs) within a nucleic acid sequence. Such ORFs may be protein encoding fragments 
and may be useful in producing commercially important proteins such as enzymes used in 

28 fermentation reactions and in the production of commercially useful metabolites. 

As used herein, "a computer-based system" refers to the hardware means, software 
means, and data storage means used to analyze the nucleotide sequence information of the 
present invention. The minimum hardware means of the computer-based systems of the present 

32 invention comprises a central processing unit (CPU), input means, output means, and data 

storage means. A skilled artisan can readily appreciate that any one of the currently available 
computer-based systems are suitable for use in the present invention. As stated above, the 
computer-based systems of the present invention comprise a data storage means having stored 
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therein a nucleotide sequence of the present invention and the necessary hardware means and 
software means for supporting and implementing a search means. As used herein, "data storage 
means' 1 refers to memory which can store nucleotide sequence information of the present 
4 invention, or a memory access means which can access manufactures having recorded thereon 
the nucleotide sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented 
on the computer-based system to compare a target sequence or target structural motif with the 
8 sequence information stored within the data storage means. Search means are used to identify 
fragments or regions of a known sequence which match a particular target sequence or target 
motif. A variety of known algorithms are disclosed publicly and a variety of commercially 
available software for conducting search means are and can be used in the computer-based 

12 systems of the present invention. Examples of such software includes, but is not limited to, 
Smith-Waterman, MacPattern (EMBL), BLASTN and BLASTA (NPOLYPEPTIDEI A) . A 
skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present 

16 computer-based systems. As used herein, a "target sequence" can be any nucleic acid or amino 
acid sequence of six or more nucleotides or two or more amino acids. A skilled artisan can 
readily recognize that the longer a target sequence is, the less likely a target sequence will be 
present as a random occurrence in the database. The most preferred sequence length of a target 

20 sequence is from about 10 to 300 amino acids, more preferably from about 30 to 100 nucleotide 
residues. However, it is well recognized that searches for commercially important fragments, 
such as sequence fragments involved in gene expression and protein processing, may be of 
shorter length. 

24 As used herein, "a target structural motif," or "target motif," refers to any rationally 

selected sequence or combination of sequences in which the sequence(s) are chosen based on a 
three-dimensional configuration which is formed upon the folding of the target motif. There are 
a variety of target motifs known in the art. Protein target motifs include, but are not limited to, 

28 enzyme active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
to, promoter sequences, hairpin structures and inducible expression elements (protein binding 
sequences). 

32 4.15 TRIPLE HELIX FORMATION 

In addition, the fragments of the present invention, as broadly described, can be used to 
control gene expression through triple helix formation or antisense DNA or RNA, both of which 
methods are based on the binding of a polynucleotide sequence to DNA or RNA. 



86 



WO 01/64835 PCT/US01/04927 

Polynucleotides suitable for use in these methods are preferably 20 to 40 bases in length and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 
Lee et aL, Nucl. Acids Res. 6:3073 (1979); Cooney et aL, Science 15241:456 (1988); and Dervan 

4 et aL, Science 251:1360 (1991)) or to the mRNA itself (antisense - Olmno, J. Neurochem. 

56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, 
Boca Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA 
transcription from DNA, while antisense RNA hybridization blocks translation of an mRNA 

8 molecule into polypeptide. Both techniques have been demonstrated to be effective in model 
systems. Information contained in the sequences of the present invention is necessary for the 
design of an antisense or triple helix oligonucleotide. 

12 4.16 DIAGNOSTIC ASSAYS AND KITS 

The present invention further provides methods to identify the presence or expression of 
one of the ORFs of the present invention, or homolog thereof, in a test sample, using a nucleic 
acid probe or antibodies of the present invention, optionally conjugated or otherwise associated 

1 6 with a suitable label . 

In general, methods for detecting a polynucleotide of the invention can comprise 
contacting a sample with a compound that binds to and forms a complex with the polynucleotide 
for a period sufficient to form the complex, and detecting the complex, so that if a complex is 

20 detected, a polynucleotide of the invention is detected in the sample. Such methods can also 

comprise contacting a sample under stringent hybridization conditions with nucleic acid primers 
that anneal to a polynucleotide of the invention under such conditions, and amplifying annealed 
polynucleotides, so that if a polynucleotide is amplified, a polynucleotide of the invention is 

24 detected in the sample. 

In general, methods for detecting a polypeptide of the invention can comprise contacting 
a sample with a compound that binds to and forms a complex with the polypeptide for a period 
sufficient to form the complex, and detecting the complex, so that if a complex is detected, a 

28 polypeptide of the invention is detected in the sample. 

In detail, such methods comprise incubating a test sample with one or more of the 
antibodies or one or more of the nucleic acid probes of the present invention and assaying for 
binding of the nucleic acid probes or antibodies to components within the test sample. 

32 Conditions for incubating a nucleic acid probe or antibody with a test sample vary. 

Incubation conditions depend on the format employed in the assay, the detection methods 
employed, and the type and nature of the nucleic acid probe or antibody used in the assay. One 
skilled in the art will recognize that any one of the commonly available hybridization, 
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amplification or immunological assay formats can readily be adapted to employ the nucleic acid 
probes or antibodies of the present invention. Examples of such assays can be found in Chard, 
T., An Introduction to Radioimmunoassay and Related Techniques, Elsevier Science Publishers, 
4 Amsterdam, The Netherlands (1986); Bullock, G.R. et al., Techniques in Immunocytochemistry, 
Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, P., Practice 
and Theory of immunoassays: Laboratory Techniques in Biochemistry and Molecular Biology, 
Elsevier Science Publishers, Amsterdam, The Netherlands (1985). The test samples of the 
8 present invention include cells, protein or membrane extracts of cells, or biological fluids such as 
sputum, blood, serum, plasma, or urine. The test sample used in the above-described method 
will vary based on the assay format, nature of the detection method and the tissues, cells or 
extracts used as the sample to be assayed. Methods for preparing protein extracts or membrane 

12 extracts of cells are well known in the art and can be readily be adapted in order to obtain a 
sample which is compatible with the system utilized. 

In another embodiment of the present invention, kits are provided which contain the 
necessary reagents to carry out the assays of the present invention. Specifically, the invention 

16 provides a compartment kit to receive, in close confinement, one or more containers which 
comprises: (a) a first container comprising one of the probes or antibodies of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash 
reagents, reagents capable of detecting presence of a bound probe or antibody. 

20 In detail, a compartment kit includes any kit in which reagents are contained in separate 

containers. Such containers include small glass containers, plastic containers or strips of plastic 
or paper. Such containers allows one to efficiently transfer reagents from one compartment to 
another compartment such that the samples and reagents are not cross-contaminated, and the 

24 agents or solutions of each container can be added in a quantitative fashion from one 

compartment to another. Such containers will include a container which will accept the test 
sample, a container which contains the antibodies used in the assay, containers which contain 
wash reagents (such as phosphate buffered saline, Tris-buffers, etc.), and containers which 

28 contain the reagents used to detect the bound antibody or probe. Types of detection reagents 
include labeled nucleic acid probes, labeled secondary antibodies, or in the alternative, if the 
primary antibody is labeled, the enzymatic, or antibody binding reagents which are capable of 
reacting with the labeled antibody. One skilled in the art will readily recognize that the disclosed 

32 probes and antibodies of the present invention can be readily incorporated into one of the 
established kit formats which are well known in the art. 

4.17 MEDICAL IMAGING 
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The novel polypeptides and binding partners of the invention are useful in medical 
imaging of sites expressing the molecules of the invention (e.g. y where the polypeptide of the 
invention is involved in the immune response, for imaging sites of inflammation or infection). 
4 See, e.g., Kunkel et al., U.S. Pat. NO. 5,413,778. Such methods involve chemical attachment of 
a labeling or imaging agent, administration of the labeled polypeptide to a subject in a 
pharmaceutically acceptable carrier, and imaging the labeled polypeptide in vivo at the target 
site. 

8 

4.18 SCREENING ASSAYS 

Using the isolated proteins and polynucleotides of the invention, the present invention 
further provides methods of obtaining and identifying agents which bind to a polypeptide 
12 encoded by an ORF corresponding to any of the nucleotide sequences set forth in SEQ ID NO: 
1-13901 , or bind to a specific domain of the polypeptide encoded by the nucleic acid. In detail, 
said method comprises the steps of: 

(a) contacting an agent with an isolated protein encoded by an ORF of the present 
16 invention, or nucleic acid of the invention; and 

(b) determining whether the agent binds to said protein or said nucleic acid. 
In general, therefore, such methods for identifying compounds that bind to a 

polynucleotide of the invention can comprise contacting a compound with a polynucleotide of 
20 the invention for a time sufficient to form a polynucleotide/compound complex, and detecting 

the complex, so that if a polynucleotide/compound complex is detected, a compound that binds 

to a polynucleotide of the invention is identified. 

Likewise, in general, therefore, such methods for identifying compounds that bind to a 
24 polypeptide of the invention can comprise contacting a compound with a polypeptide of the 

invention for a time sufficient to form a polypeptide/compound complex, and detecting the 

complex, so that if a polypeptide/compound complex is detected, a compound that binds to a 

polynucleotide of the invention is identified. 
28 Methods for identifying compounds that bind to a polypeptide of the invention can also 

comprise contacting a compound with a polypeptide of the invention in a cell for a time 

sufficient to form a polypeptide/compound complex, wherein the complex drives expression of a 

receptor gene sequence in the cell, and detecting the complex by detecting reporter gene 
32 sequence expression, so that if a polypeptide/compound complex is detected, a compound that 

binds a polypeptide of the invention is identified. 

Compounds identified via such methods can include compounds which modulate the 

activity of a polypeptide of the invention (that is, increase or decrease its activity, relative to 
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activity observed in the absence of the compound). Alternatively, compounds identified via such 
methods can include compounds which modulate the expression of a polynucleotide of the 
invention (that is, increase or decrease expression relative to expression levels observed in the 
4 absence of the compound). Compounds, such as compounds identified via the methods of the 
invention, can be tested using standard assays well known to those of skill in the art for their 
ability to modulate activity/expression. 

The agents screened in the above assay can be, but are not limited to, peptides, 
8 carbohydrates, vitamin derivatives, or other pharmaceutical agents. The agents can be selected 
and screened at random or rationally selected or designed using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and 
the like are selected at random and are assayed for their ability to bind to the protein encoded by 

12 the ORF of the present invention. Alternatively, agents may be rationally selected or designed. 
As used herein, an agent is said to be "rationally selected or designed" when the agent is chosen 
based on the configuration of the particular protein. For example, one skilled in the art can 
readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 

16 like, capable of binding to a specific peptide sequence, in order to generate rationally designed 
antipeptide peptides, for example see Hurby et al., Application of Synthetic Peptides: Antisense 
Peptides," In Synthetic Peptides, A User's Guide, W.H. Freeman, NY (1992), pp. 289-307, and 
Kaspczak et aL, Biochemistry 28:9230-8 (1989), or pharmaceutical agents, or the like. 

20 In addition to the foregoing, one class of agents of the present invention, as broadly 

described, can be used to control gene expression through binding to one of the ORFs or EMFs 
of the present invention. As described above, such agents can be randomly screened or 
rationally designed/selected. Targeting the ORF or EMF allows a skilled artisan to design 

24 sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. One class of DNA binding 
agents are agents which contain base residues which hybridize or form a triple helix formation 
by binding to DNA or RNA. Such agents can be based on the classic phosphodiester, 

28 ribonucleic acid backbone, or can be a variety of sulfhydryl or polymeric derivatives which have 
base attachment capacity. 

Agents suitable for use in these methods preferably contain 20 to 40 bases and are 
designed to be complementary to a region of the gene involved in transcription (triple helix - see 

32 Lee et al., Nucl. Acids Res. 6:3073 (1979); Cooney et al., Science 241 :456 (1 988); and Dervan et 
al., Science 251:1360 (1991)) or to the mRNA itself (antisense - Okano, J. Neurochem. 56:560 
(1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription 
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from DNA, while antisense RNA hybridization blocks translation of an mRNA molecule into 
polypeptide. Both techniques have been demonstrated to be effective in model systems. 
Information contained in the sequences of the present invention is necessary for the design of an 

4 antisense or triple helix oligonucleotide and other DNA binding agents. 

Agents which bind to a protein encoded by one of the ORPs of the present invention can 
be used as a diagnostic agent. Agents which bind to a protein encoded by one of the ORFs of the 
present invention can be formulated using known techniques to generate a pharmaceutical 

8 composition. 

4.19 USE OF NUCLEIC ACIDS AS PROBES 

Another aspect of the subject invention is to provide for polypeptide-specific nucleic acid 
12 hybridization probes capable of hybridizing with naturally occurring nucleotide sequences. The 
hybridization probes of the subject invention may be derived from any of the nucleotide 
sequences SEQ ID NOS: 1-13901 . Because the corresponding gene is only expressed in a 
limited number of tissues, a hybridization probe derived from of any of the nucleotide sequences 
16 SEQ ID NO: 1-13901 can be used as an indicator of the presence of RNA of cell type of such a 
tissue in a sample. 

Any suitable hybridization technique can be employed, such as, for example, in situ 
hybridization. PCR as described in US Patents Nos. 4,683,195 and 4,965,1 88 provides 

20 additional uses for oligonucleotides based upon the nucleotide sequences. Such probes used in 
PCR may be of recombinant origin, may be chemically synthesized, or a mixture of both. The 
probe will comprise a discrete nucleotide sequence for the detection of identical sequences or a 
degenerate pool of possible sequences for identification of closely related genomic sequences. 

24 Other means for producing specific hybridization probes for nucleic acids include the 

cloning of nucleic acid sequences into vectors for the production of mRNA probes. Such vectors 
are known in the art and are commercially available and may be used to synthesize RNA probes 
in vitro by means of the addition of the appropriate RNA polymerase as T7 or SP6 RNA 

28 polymerase and the appropriate radioactively labeled nucleotides. The nucleotide sequences may 
be used to construct hybridization probes for mapping their respective genomic sequences. The 
nucleotide sequence provided herein may be mapped to a chromosome or specific regions of a 
chromosome using well known genetic and/or chromosomal mapping techniques. These 

32 techniques include in situ hybridization, linkage analysis against known chromosomal markers, 
hybridization screening with libraries or flow-sorted chromosomal preparations specific to 
known chromosomes, and the like. The technique of fluorescent in situ hybridization of 
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chromosome spreads has been described, among other places, in Verma et al (1988) Human 
Chromosomes: A Manual of Basic Techniques, Pergamon Press, New York NY. 

Fluorescent in situ hybridization of chromosomal preparations and other physical 

4 chromosome mapping techniques may be correlated with additional genetic map data. Examples 
of genetic map data can be found in the 1994 Genome Issue of Science (265: 198 If). Correlation 
between the location of a nucleic acid on a physical chromosomal map and a specific disease (or 
predisposition to a specific disease) may help delimit the region of DNA associated with that 

8 genetic disease. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier or affected individuals. 

4.20 PREPARATION OF SUPPORT BOUND OLIGONUCLEOTIDES 

Oligonucleotides, i.e. , small nucleic acid segments, may be readily prepared by , for 
1 2 example, directly synthesizing the oligonucleotide by chemical means, as is commonly practiced 

using an automated oligonucleotide synthesizer. 

Support bound oligonucleotides may be prepared by any of the methods known to those of 

skill in the art using any suitable support such as glass, polystyrene or Teflon. One strategy is to 
1 6 precisely spot oligonucleotides synthesized by standard synthesizers , Immobilization can be 

achieved using passive adsorption (Inouye & Hondo, (1 990) J. Clin. Microbiol. 28(6) 1469-72); 

using UV light (Nagata et al, 1985; Dahlen et al } 1987; Morrissey & Collins, (1989) MoL Cell 

Probes 3(2) 1 89-207) or by covalent binding of base modified DNA (Keller et al, 1 988; 1 989); all 
20 references being specifically incorporated herein. 

Another strategy that may be employed is the use of the strong biotin-streptavidin 

interaction as a linker. For example, Broude et al (1994) Proc. Natl. Acad. ScL USA 91(8) 3072-6, . 

describe the use of biotinylated probes, although these are duplex probes, that are immobilized on 
24 streptavi din-coated magnetic beads. Streptavidin-coated beads may be purchased from Dynal, 

Oslo. Of course, this same linking chemistry is applicable to coating any surface with streptavidin. 

Biotinylated probes may be purchased from various sources, such as, e.g. 9 Operon Technologies 

(Alameda, CA). 

28 Nunc Laboratories (Naperville, IL) is also selling suitable material that could be used. Nunc 

Laboratories have developed a method by which DNA can be covalently bound to the microwell 
surface termed Covalink NH. CovaLink NH is a polystyrene surface grafted with secondary amino 
groups (>NH) that serve as bridge-heads for further covalent coupling. CovaLink Modules may be 

32 purchased from Nunc Laboratories. DNA molecules may be bound to CovaLink exclusively at the 
5 -end by a phosphoramidatebond, allowing immobilization of more than 1 pmol of DNA 
(Rasmussen^a/., (1991) Anal. Biochem. 198(1) 138-42). 
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The use of CovaLinkNH strips for covalent binding of DNA molecules at the S'-end has 
been described (Rasmussenet al., (1991). In this technology, a phosphoramidatebond is employed 
(Chu et al., (1983) Nucleic Acids Res. 1 1(8) 6513-29). This is beneficial as immobilization using 
4 only a single covalent bond is preferred. The phosphoramidate bond joins the DNA to the 

CovaLink NH secondary amino groups that are positioned at the end of spacer arms covalently 
grafted onto the polystyrene surface through a 2 nm long spacer arm. To link an oligonucleotide to 
CovaLink NH via an phosphoramidate bond, the oligonucleotide terminus must have a 5 f -end 
8 phosphate group. It is, perhaps, even possible for biotin to be covalently bound to CovaLink and 
then streptavidinused to bind the probes. 

More specifically, the linkage method includes dissolving DNA in water (7.5 ng/ul) and 
denaturing for 10 min. at 95°C and cooling on ice for 10 min. Ice-cold 0.1 M 1-methylimidazole, 
1 2 pH 7.0(1 -Melm 7 ), is then added to a final concentration of 1 0 mM 1 -Melm 7 . A ss DNA solution is 
then dispensed into CovaLink NH strips (75 ul/well) standing on ice. 

Carbodiimide 0.2 M 1 -e1hyl-3-(3-dimethylaminopropyl)-carbodiimide (EDC), dissolved in 
10 mM 1-Melm 7? is made fresh and 25 ul added per well. The strips are incubated for 5 hours at 
1 6 50°C. After incubation the strips are washed using, e.g. , Nunc-Immuno Wash; first the wells are 
washed 3 times, then they are soaked with washing solution for 5 min., and finally they are washed 
3 times (where in the washing solution is 0.4 N NaOH, 0.25% SDS heated to 50°C). 

It is contemplated that a further suitable method for use with the present invention is that 
20 described in PCT Patent Application WO 90/03382 (Southern & Maskos), incorporated herein by 
reference. This method of preparing an oligonucleotide bound to a support involves attaching a 
nucleoside 3'-reagent through the phosphate group by a covalent phosphodiester link to aliphatic 
hydroxyl groups carried by the support. The oligonucleotide is then synthesized on the supported 
24 nucleoside and protecting groups removed from the synthetic oligonucleotide chain under standard 
conditions that do not cleave the oligonucleotide from the support. Suitable reagents include 
nucleoside phosphoramidite and nucleoside hydrogen phosphorate. 

An on-chip strategy for the preparation of DNA probe for the preparation of DNA probe 
28 arrays may be employed. For example, addressable laser-activated photodeprotection may be 

employed in the chemical synthesis of oligonucleotides directly on a glass surface, as described by 
Fodor etal (1991) Science 25 1(4995) 767-73, incorporated herein by reference. Probes may also 
be immobilized on nylon supports as described by Van Ness et al. ( 1 99 1 ) Nucleic Acids Res. 
32 19(12) 3345-50; or linked to Teflon using the method of Duncan & Cavalier (1988) Anal. Biochem. 
169(1) 104-8; all references being specifically incorporated herein. 
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To link an oligonucleotide to a nylon support, as described by Van Ness etal (1991), 
requires activation of the nylon surface via alkylation and selective activation of the 5~amine of 
oligonucleotides with cyanuric chloride, 

4 One particular way to prepare support bound oligonucleotides is to utilize the 

light-generated synthesis described by Pease et al, (1994) PNAS USA 91(1 1) 5022-6, incorporated 
herein by reference). These authors used current photolithographic techniques to generate arrays of 
immobilized oligonucleotide probes (DNA chips). These methods, in which light is used to direct 

8 the synthesis of oligonucleotide probes in high-density, miniaturized arrays, utilize photolabile 
5 -protected jV-acyl-deoxynucleosidephosphoramidites, surface linker chemistry and versatile 
combinatorial synthesis strategies. A matrix of 256 spatially defined oligonucleotide probes may be 
generated in this manner. 

12 4.21 PREPARATION OF NUCLEIC ACID FRAGMENTS 

The nucleic acids may be obtained from any appropriate source, such as cDNAs, genomic 
DNA, chromosomal DNA, microdissected chromosome bands, cosmid or YAC inserts, and RNA, 
including mRNA without any amplification steps. For example, Sambrook et al ( 1 989) describes 
1 6 three protocols for the isolation of high molecular weight DNA from mammalian cells (p. 
9.14-9.23). 

DNA fragments may be prepared as clones in M 1 3 , plasmid or lambda vectors and/or 
prepared directly from genomic DNA or cDNA by PCR or other amplificationmethods. Samples 
20 may be prepared or dispensed in multiwell plates. About 100-1000 ng of DNA samples may be 
prepared in 2-500 ml of final volume. 

The nucleic acids would then be fragmented by any of the methods known to those of skill 
in the art including, for example, using restriction enzymes as described at 9.24-9.28 of Sambrook et 
24 al (1989), shearing by ultrasound and NaOH treatment. 

Low pressure shearing is also appropriate, as described by Schriefer et al (1 990) Nucleic 
Acids Res. 1 8(24) 7455-6, incorporated herein by reference). In this method, DNA samples are 
passed through a small French pressure cell at a variety of low to intermediate pressures. A lever 
28 device allows controlled application of low to intermediate pressures to the cell. The results of 

these studies indicate that low-pressure shearing is a useful alternative to sonic and enzymatic DNA 
fragmentation methods. 

One particularly suitable way for fragmenting DNA is contemplated to be that using the two 
32 base recognition endonuclease, Cv/JI, described by Fitzgerald et al (1 992) Nucleic Acids Res. 

20(14) 3753-62. These authors described an approach for the rapid fragmentation and fractionation 
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of DNA into particular sizes that they contemplated to be suitable for shotgun cloning and 
sequencing. 

The restriction endonuclease CviJl normally cleaves the recognition sequence PuGCPy 
4 between the G and C to leave blunt ends. Atypical reaction conditions, which alter the specificity of 
this enzyme (Cvi JI* *), yield a quasi-random distribution of DNA fragments form the small 
molecule pUC 1 9 (2688 base pairs). Fitzgerald et al (1 992) quantitatively evaluated the 
randomness of this fragmentation strategy, using a CvzJI** digest of pUCl 9 that was size 
8 fractionated by a rapid gel filtration method and directly ligated, without end repair, to a lac Z minus 
Ml 3 cloning vector. Sequence analysis of 76 clones showed that CvzJI** restricts pyGCPy and 
PuGCPu, in addition to PuGCPy sites, and that new sequence data is accumulated at a rate 
consistent with random fragmentation. 

12 As reported in the literature, advantages of this approach compared to sonication and 

agarose gel fractionation include: smaller amounts of DNA are required (0.2-0.5 ug instead of 2-5 
ug); and fewer steps are involved (no preligation, end repair, chemical extraction, or agarose gel 
electrophoresis and elution are needed 

1 6 Irrespective of the manner in which the nucleic acid fragments are obtained or prepared, it is 

important to denature the DNA to give single stranded pieces available for hybridization. This is 
achieved by incubating the DNA solution for 2-5 minutes at 80-90°C. The solution is then cooled 
quickly to 2°C to prevent renaturation of the DNA fragments before they are contacted with the 

20 chip. Phosphate groups must also be removed from genomic DNA by methods known in the art. 

4.22 PREPARATION OF DNA ARRAYS 

Arrays may be prepared by spotting DNA samples on a support such as a nylon membrane. 
Spotting may be performed by using arrays of metal pins (the positions of which correspond to an 

24 array of wells in a microliter plate) to repeated by transfer of about 20 nl of a DNA solution to a 
nylon membrane. By offset printing, a density of dots higher than the density of the wells is 
achieved. One to 25 dots may be accommodated in 1 mm 2 , depending on the type of label used. By 
avoiding spotting in some preselected number of rows and columns, separate subsets (subarrays) 

28 may be formed. Samples in one subarray may be the same genomic segment of DNA (or the same 
gene) from different individuals, or may be different, overlapped genomic clones. Each of the 
subarrays may represent replica spotting of the same samples. In one example, a selected gene 
segment may be amplified from 64 patients. For each patient, the amplified gene segment may be in 

32 one 96-weII plate (all 96 wells containing the same sample). A plate for each of the 64 patients is 
prepared. By using a 96-pin device, all samples may be spotted on one 8x12 cm membrane. 
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Subarrays may contain 64 samples, one from each patient. Where the 96 subarrays are identical, the 
dot span may be 1 mm 2 and there may be a 1 mm space between subarrays. 

Another approach is to use membranes or plates (available from NUNC, Naperville, Illinois) 
4 which may be partitioned by physical spacers e. g. a plastic grid molded over the membrane, the grid 
being similar to the sort of membrane applied to the bottom of multiwell plates, or hydrophobic 
strips. A fixed physical spacer is not preferred for imaging by exposure to flat phosphor-storage 
screens or x-ray films. 

8 The present invention is illustrated in the following examples. Upon consideration of the 

present disclosure, one of skill in the art will appreciate that many other embodiments and variations 
may be made in the scope of the present invention. Accordingly, it is intended that the broader 
aspects of the present invention not be limited to the disclosure of the following examples. The 

12 present invention is not to be limited in scope by the exemplified embodiments which are intended 
as illustrations of single aspects of the invention, and compositions and methods which are 
functionally equivalent are within the scope of the invention. Indeed, numerous modifications and 
variations in the practice of the invention are expected to occur to those skilled in the art upon 

16 consideration of the present preferred embodiments. Consequently, the only limitations which 
should be placed upon the scope of the invention are those which appear in the appended claims. 

All references cited within the body of the instant specification are hereby incorporated by 
reference in their entirety. 

20 5.0 EXAMPLES 

5.1 EXAMPLE 1 

Novel Nucleic Acid Sequences Obtained From Various Libraries 

A plurality of novel nucleic acids were obtained from cDNA libraries prepared from various 
24 human tissues and in some cases isolated from a genomic library derived from human chromosome 
using standard PCR ? SBH sequence signature analysis and Sanger sequencing techniques. The 
inserts of the library were amplified with PCR using primers specific for the vector sequences 
which flank the inserts. Clones from cDNA libraries were spotted on nylon membrane filters and 
28 screened with oligonucleotide probes (e.g. , 7-mers) to obtain signature sequences. The clones were 
clustered into groups of similar or identical sequences. Representative clones were selected for 
sequencing. 

In some cases, the 5 f sequence of the amplified inserts was then deduced using a typical 
32 Sanger sequencing protocol. PCR products were purified and subjected to fluorescent dye 

terminator cycle sequencing. Single pass gel sequencing was done using a 377 Applied Biosystems 
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(ABI) sequencer to obtain the novel nucleic acid sequences. In some cases RACE (Random 
Amplification of cDNA Ends) was performed to further extend the sequence in the 5 ' direction. 

4 5.2 EXAMPLE 2 

Novel Contigs 

The novel contigs of the invention were assembled from sequences that were obtained from 
a cDNA library by methods described in Example 1 above, and in some cases sequences obtained 
8 from one or more public databases. The sequences for the resulting nucleic acid contigs are 

designated as SEQ ID NO: 1 -1 390 1 and are provided in the attached Sequence Listing. The contigs 
were assembled using an EST sequence as a seed. Then a recursive algorithm was used to extend 
the seed EST into an extended assemblage, by pulling additional sequences from different databases 

1 2 (/. e. , Hy seq 5 s database containing EST sequences, dbEST version 1 14, gb pri 114, and UniGene 
version 101) that belong to this assemblage. The algorithm terminated when there was no 
additional sequences from the above databases that would extend the assemblage. Inclusion of 
component sequences into the assemblage was based on a BL ASTN hit to the extending 

1 6 assemblage with BLAST score greater than 300 and percent identity greater than 95%. 

Table 3 sets forth the novel predicted polypeptides (including proteins) encoded by the 
novel polynucleotides (SEQ ID NO: 1 -13901) of the present invention, and their corresponding 
nucleotide locations to each of SEQ ID NO: : 1-13901. Table 3 also indicates the method by which 

20 the polypeptide was predicted. Method A refers to a polypeptide obtained by using a software 

program called FAST Y (available from http ://fasta.bioch. virginia.edu ) which selects a polypeptide 
based on a comparison of the translated novel polynucleotide to known polynucleotides (W.R. 
Pearson, Methods in Enzymology, 1 83 :63-98 (1 990), herein incorporated by reference). Method B 

24 refers to a polypeptide obtained by using a software program called GenScan for human/vertebrate 
sequences (available from Stanford University, Office of Technology Licensing) that predicts the 
polypeptide based on a probabilistic model of gene structure/compositionalproperties (C. Burge 
and S. Karlin, J. Mol. Biol., 268:78-94 (1997), incorporated herein by reference). Method C refers 

28 to a polypeptide obtained by using a Hyseq proprietary software program that translates the novel 
polynucleotide and its complementary' strand into six possible amino acid sequences (forward and 
reverse frames) and chooses the polypeptide with the longest open reading frame. 

The nearest neighbor results for SEQ ID NO: 1-13901 were obtained by a BLASTP 

32 version 2.0al 1 9MP- WashU search against Genpept release 120 and Geneseq database release 
200101 (Derwent), using BLAST algorithm. The nearest neighbor result showed the closest 
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homologue for SEQ IDNOS: 1-13901. The nearest neighbor results for SEQ ID NO: 1-13901 
are shown in Table 2 below. 

Tables 1 , 2 and 3 follow. Table 1 shows the various tissue sources of SEQ ID NOS: 1 - 

4 13901. Table 2 shows the nearest neighbor result for the assembled contig. The nearest neighbor 
result shows the closest homolog with an identifiable function for each assemblage. Table 3 
contains the start and stop nucleotides for the translated amino acid sequence for which each 
assemblage encodes. Table 3 also provides a correlation between the amino acid sequences set forth 

8 in the Sequence Listing, the nucleotide sequences set forth in the Sequence Listing and the SEQ ID 
NO.inUSSN 09/515,126 
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TABLE 1 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 


adult brain 


GIBCO 


AB3001 


83 544 597-598 600-607 616 841 1004 1148 1346 1493 1974 
2138 2141 2143 2161 22662345 2363 2511 2569 2876 2880 
3001 3099-3101 3105-3106 3110-3111 3115-3117 3199 3272 
3282 3284 3356 3425 3537 3634 3689 3709 3797 3810 3839 
3899 4006 4021-4022 4025 4043 4194 4201 4253 4277 4297 
4388 4399 4410 4667 4671 4722 4747-4748 4750 4755 4767 
4845 4865 4940 5037 5075 5093 5118 5163 5171-5172 5268 
5481 5523 5553 5656 5724 5894 5902 5938 6052 6170-6173 
6176 6214 6307 6336 6369 6374 6793 6894-6897 6979 7058 
7169 7455 7492-7493 7495-7499 7501 7504 7577 7586 7761 
7792 7864 7870 803 5 8065 8085 8110 8120 8140 8224 8226 
8298 8372 8427 8452 8456 8535 8648 8672 8674-8679 8681- 
8684 8816 8838-8839 8870 8898 9012 9041 9079 9128 9257 
9264 9304 9317 9460 9503 9517 9567 9623 9734 9781 9792- 

10778 10786 10895 10984-10986 11032 11052 11069 11130 
11145 11239 11289 11402 11818 11862 11870-11876 11878- 
11881 12017 12037 12127 12160 12294 12363 12375 12405 
12424 12438 12467 12539 12570 12590 12615-12616 12618 
12685 12688 12712 12739 12748 12830 12913 12916 12948- 
12950 13002 13064 13073 13083 13141 13150 13153 13164- 
13166 13257 13391 13456 13479 13489 13492 13494 13499 
13501 13503 13560 13595-13596 13627 13645 13679 13782 
13795 13861 13866 13869 13882 


adult brain 


GIBCO 


ABD003 


67 83 142 443 587 598 608-609 611 613-624 633 731 734 737- 
742 760 799-800 809 1148 1152 1167-1184 1193 1346 1433 
1516 1552 1575 1671 1756 1774 1833 1974 2138 2145 2176- 
2178 2237 2266 2299-2301 2303-2306 2343 2363 2412 2444 
2449 251 1 2516 2555 2569 2576 2614 2716 2809 2876 291 1 
2926 3001 3093 3114 31193121-31243126 3128-31303234 
3254-3256 3258-3263 3265-3267 3270-3274 3276-3277 3280- 
328 1 3284 3286 3348 3356 3378 3435 3459 3484 3537 3548 
3595 3605 3625 3627 3634 3686-3697 3700 3702 3709 371 1 
3720 3722 3737 3757 3797 3804 3810 3839 3856 4006 4019 
4025 4040 4055 4057-4058 4060 4078 4194 4201 4246 4253 
4277 4282 4390 4405 4412 4431 4620 4622 4641 4689 4751- 
4764 4791 4808 4837 4845 4847-4849 4852-4858 4860-4862 
4864-4869 4940 4957 4962 4972 4998 5021 5031 5037-5038 
5040 5076 5093 5108 5118 5167 5169 5171-5172 5251-5261 
5263-5265 5270 5364 5401 5481 5492 5521 5523 5535 5656 
5674 5693 5766 5788 5817 5906-5909 5938 6005 6027 6057 
6064 6147 6178 6180-6182 6189 6214 6229-6233 6254 6272 
6369 6371 6421-6426 6555 6595 6598 6601 6799 6803 6825 
6836 6886 6894 6913 6972 6995 7058 7104 7130 7133 7148 
7164 7169 7339 7347 7386 7426 7455 7494 7502 7507 7509 
751 1-7512 7516 7520 7584-75877590-7596 7598-7601 7603- 
7604 7608 7632 7677 7743 7748 7761 7768 7792 7797 7807 
7815 7839 7849-7861 7864 7870 7930 7937 8035 8065 8067 
8080 8087 8095 8110 8120 8139-8140 8209 8224 8226 8235 
8246 8262 8285 8298 8320 8323 8336 8354 8361 8365 8370 
8375 8387 8452 8456 8535 8556 8576-8577 8603 8630 8648 
8674 8685-8686 8688-8690 8693 8695 8702 8712 8742 8760- 
8761 8763-8764 8766-8769 8813 8815-8816 8830 8834 8838- 
8839 8848 8863 8870 8898 8921 $943-8944 8951 8989 9010 
9041 9050-9056 9058-9064 9076 9079 9092 9097 9128 9144- 
9145 9257 9264 9271 9278-9279 9304 9315 9317 9455 9466 
9472 9475 9480 9503 951 1 9517 9525 9539 9689 9734 9773 
9781 9791 9799-9802 9847 9852 9873 9928-9929 9964 9999- 
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10001 10003-10004 10053 10175 10267 10276 10330 10349 
10434 10449 10466 10471-10474 10492 10496 10509 10566 
10578 10597-10599 10645 10679 10788 10891 10948 10988- 
10990 11032 11039 11041-11043 11052 11065 11069 11105 
11108 11130 11145 11167 11196 1 1203-1 1207 1 1209 11239 
11399 11401-11402 11406 11459 11470 11604 11606 11642 
11761 11818 11862 11877 11882-11884 1 1886 11889-11893 
11944-11946 11981 11988 12016 12019 12022 12037 12083 
12127 12343 12164-12165 12168-12171 12178 12195 12236 
12265 12305 12327 12363 12375 12405 12423-12424 12430 
12438 12546 12570 12590 12594 12612 12615-12618 12630 
12670 12674 12685-12688 12693 12704 12706-12707 12748 
12772 12830 12885 12904 12913 12916 12923 12933 12951 
12956 12993 13001 13020-13021 13038 13047 13064 13072- 
13073 13084-13085 13092 13117 13142 13167-13171 13191 
13254 13257 13260 13295 13390-13391 13394 13456 13479 
13483 13489 13497 13501 13503 13505-13507 13512 13516 
13546 13551 13555 13575 13590 13592 13597 13613-13614 
13645 13649 13659 13711 13782 13795 13838 13861 13869 
13875 13882 13884-13885 13888 13892 13896 


adult brain 


Clontech 


ABR001 


142 858 1542 21742407 2483 2652 3272 3287 3460 3492 3535 
3595 3 737 3839 4005 4060 4282 4434 4791 4972 5040 5293 
5523 5530 5535 5788 5906 6082 6601 6799 6980 7373 7577 
7587 7759 7788 7851 8081-8082 8110 9167 9455 9466 9781 
9928 10422 10774 10791 11069 1 1401 11406 11459 11604 
11607 11791 11818 11865 11961 11979 12022 12122 12160 
12327 12442 12594 12615 12640 12670 12705 12935 12957 
12985 13047 13197 13257 13456 13511-13512 13546 13554 
13646 13793 13885 13889 13893 


adult brain 


Clontech 


ABR006 


6 67 1004 1908 3272 3286 3548 401 1 4282 4998 5923 5928 
6374 6730 6815 6867 6890 7067 8365 9264 9729 9780 10776 
11587 11618 12596 12601 12605 12704 12749 12754 12951 
13047 13051 13090 13479 13488 13498-13499 13503 13512 
13575 13882 


adult brain 


Clontech 


ABR008 


6 11 21 41 51 88 142 364 376 579 598 651 736 800 1050 1 148 
1184 1251-1265 1291 1346 1404 1479 1529 1543 1671-1674 
1697 1699-1710 1820 1830 1832-1838 1840 1848-1849 1908 
1914 1919 1927 1957 1964 1974 1976 1978-1979 2005-2006 
2050 2081 2090 2110-21 11 2129 2150 2174 2200 2310 2327 
2342 2408-2410 2420 2444 2449 2461-2467 2484 2490 2499- 
2506 25 1 1 2553 2574 2576 26 1 1 2652 2809 2827 2866 2894 
3032 3207 3 535 3591 3610 3634 3715 3722 3737 3766-3770 
3819 4006-4007 4011 4025 4032 4060 4078 4095 4109 4128 
4143-4155 4182 4194 4247-4257 4277 4282 4294 4296 4310 
4330 4348 4355 4360 4381 4395 4399 4411 4431 4543 4641 
4662 4694 4698 4767 4781 4791 4808 4833 4837 4985 5001 
5022 5040 5075 5094 5108 5163 5303-5306 5308-5314 5320 
5380 5523 5553 5615 5625-5626 5634 5638-5644 5701 5706 
5711 5727-5742 5766 5772 5775 5783 5801 5814 5817 5820- 
5821 5829 5837 5851 5855 5858 5864 5867 5874 5885 5890 
5897 5901 5906 5923 6057 6125 6214 6223 6288 6302 6456- 
6464 6545 6598 6601-6602 6624 6676-6685 6699 6726 6728 
6746-6749 6765 6799 6805 6854 6860 6893-6894 7004-7007 
7049 7076 7078 7081-7083 7105 71 17 71 19 7133 7153 7166 
7431 7579 7708 7768 7849 7900-7905 8018 8083-8084 8095 
8110 8196-8208 8262 8288 8312 8320 8331 8336 8356 8375 
8452 8482 8633 8681 8710 8739 8777 8815 8817 8830 8839 
8963 8965 8983 9010 9097 9100 9102-9108 9111 9128 9142 
9257 9264 9313 9364 9378-9384 9401 9454-9455 9458 9460- 
9464 9503 9509 951 1 9515-9516 9522 9528-9529 9533 9539 
9542 9544 9573 9577 9646 9773 9780 9924 10000 10025- 
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10029 10097 10148 10209 10218-10223 10225-10229 10267 
10275-10276 10284 10292 10296 10303 10313 10326 10329- 
10331 10333-1033410343 10346 10393 10430 10494 10496 
10513 10542 10557 10613-10616 10679 10688 10691 10719- 
10721 10729 10743 10758 10760 10776 10782 10786 10795 
10895 11100 11114 11132 11195 11240 11242-11254 11401 ! 
1 1406 1 1437 1 1454 1 1459 1 1462 1 1466-1 1473 1 1475-1 1476 
11519 11532-11535 11550 11553 11555-1 1556 1 1559 1 1588- 
11589 11606 11615 11618 11621 11627-11628 11633 11761 
11791 11807 11818 11932 11955 12006 12039 12041 12092 
12212-12224 12231 12236 12305 12363 12368 12396-12399 
12405 12424 12439 12442 12465-12466 12468 12477 12479- 
12482 12490 12521 12546 12552-12553 12576-12577 12579 
12582 12585 12590 12601 12609 12617-12618 12636 12658 
12707 12725 12735-12736 12749 12754 12776-12779 12859 
12868 12894 12905 12909 12940 12955 12959 12977 12980 
12990 13002 13004-13005 13020 13035 13038 13042-13044 
13047 13051-13052 13056 13062 13073 13082-13083 13196 
13249 13280 13311-13312 13336 13387 13417 13421-13426 
13436 13445 13456 13458 13479-13482 13488 13490 13494- 
13495 13497-13500 13503 13507 13512 13516 13533 13546 
13554-13555 13590 13613 13630 13649 13659 13670 13678 
13713 13724 13769 13793-13794 13808 13827-13828 13838 
13861 13867-13868 13875 13882 13884-13885 13888-13889 
13893 13896 13898 


adult brain 


Clontech 


ABR011 


1006 1257 3797 4006 4025 5535 6057 7169 7870 8262 8937 
8966 9257 10778 12736 13394 13679 13793 13861 


adult brain 


BioChain 


ABR012 


88 598 1007 1 134 2597 3557 3590 3627 3797 4006 4192 4246 
4282 4391 4940 5523 5535 6288 6338 7138 8110 8898 9076 
9401 9455 9476 10772 11061 11114 12989 13394 13511 13866 


adult brain 


Invitrogen 


ABR013 


598 2614 3191 4355 4391 5523 5788 8085 8486 11513 12521 
12989 13861 


adult brain 


Invitrogen 


ABT004 


40 51 598 1050-1057 1148 1777-1778 1947 1976 2270-2272 
2327 2490 2617 3050 3600-3602 3722 3987 4390-4391 4434 
4543 4689 5031 5157-5159 5167 5169 5466 5505 5682-5683 
5701 5766 5778 5794 5902 6147 6367-6371 6459 6545 6709 
6728 6783 6801 6971 7104 7175 7815 7839 7864 8139 8342 
8345 8355 8363 8372 8452 8633 8963 8975-8976 9012 9133 
9423-9424 9511 9515 9517 9528 9556 9827 9949 10260 10267 
10275 10570-10571 10733 10767 11132 11159 11406 11459 
11932 12009 12092 12109-12111 12127 12283 12428 12511 
12579 12605 12725 12747 12830 12885-12886 12910 12913 
12954 12987-12989 13051 13054 13062 13073 13090 13249- 
13253 13438 13445 13456 13489 13500 13512 13516 13533 
13546 13590 13622 13649 13683-13684 13713 13803 13838 
13861 13866 13896 


cultured 
preadipocytes 


Strategene 


ADP001 


1 134 1346 2343 2614 3272 3426 3610 3720 3839 3885 401 1 
4277 4282 4297 4346 4388 4391 4405 4434 4641 4833 4940 
4985 5018 5030 5040 5163 5167 5523 5581 5778 5788 5794 
5895 5951 6082 6147 6272 6607 7067 7141 8093 8235 8285 
8312 8363 8629 8648 8830 8839 9290 9401 9466 9503 9781 
10346 10470 10776 10795 10971 1 1 108 1 1 170 1 1513 11818 
12034 12037 12046 12093 12375 12387 12405 12424 12570 
12636 12670 12674 12688 12735 12749 12913 12940 13126 
13163 13295 13489 13494 13497 13499 13511 13516 13575 
13652-13653 13866 13888-13889 


adrenal gland 


Clontech 


ADR002 


8 83 142 225 351 443 551 569 731 864 1134 1266-1271 1273- 
1274 1276-1292 1294-1295 1381 1391 1544-1545 1658 1671 
1908 1959 1983 2010 2023 2145 2175 2283 23102328-2334 
2343 2444 2449 2510 2522 2576 3032 3069 3153 3166 3272 
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3378 3416 3548 3625 3709 371 1 3771-3788 3790-3791 3797 
3839 3870 3899 3985 4019 4054 4060 4109 4173 4192 4194 
4201 4253 4277 4282 4389 4391 4395-4396 4431 4667 4687 
4694 4783 4957 4966 4985 4998 5037 5108 5293 5316-5323 
5325-5336 5481 5505 5527 5531-5533 5560 5628 5656 5701 
5766 5865 5874 5902 5939 5979 6020 6052 6057 6227 6288 
6354 6405 6449 6465-6482 6521 6603 6726 6894 6979 7008- 
7011 7155-7156 7204 7604 7634 7845 7871 7906-7912 7915- 
7918 7920-7930 8022 8067 8085-8086 8095 8110 8116 8224 
8262 8363 8365 8412 8520 8535 8554 8699 8742 8831 8870 
8950-8951 9002 9010 9012 9097 9109-9119 9121-9127 9190 
9264 9280 9304 9317 9455 9457 9495 9503 9553 9556 9586 
9709 9780-9781 9895 9927 10007 10030-10032 10034-10035 
10037-10045 10119 10277 10284 10346 10595 10617-10620 
10623-10627 10645 10675 10679 10760-10761 10766 10774 
10782 11108 11194 1 1255 1 1258-1 1270 1 1289 1 1400 1 1406 
11457 11519 11588 11600 11621 1 1626 1 1818 1 1952 12080 
12159 12225 12227-12230 12232-12237 12239-12240 12242- 
12249 12251 12305 12363 12375 12404-12405 12424 12439 
12577 12599 12601 12630 12636 12657-12658 12663 12688 
12693 12706 12713 12715 12735 12749 12754 12780-12787 
12848 12863 12909 12913 12923 12957 12990 13006-13007 
13020-13021 13064 13095 13104 13295 13313 13315 13388- 
13389 13391 13456 13473 13494 13507 13515 13546 13613 
13631 13679 13725-13733 13858 13866 13872 13883-13884 
13888-13889 


adult heart 


GIBCO 


AHR001 


51 83 88 94 221 239 360 366-367 404 410-411 413 415 458- 
459 461 465-468 471 473-478 486 545-546 559 567 616 625- 
630 743-744 799 802-806 808 810-835 837-842 959 1004 1066- 
1085 1134 1178 1184-1193 1346 1512 1516 1546-15471556 
1575 1671 1727 1774 1829 1959 1976-1977 2090 2096 2108 
21 10 2128 2138 2145-2147 2161 2179 2195-2198 2257 2276 
2278-2281 2302 2307-2309 2363 2398-2399 2409 241 1-2412 
2444 2449 2497 25 1 6 2529 2563 2569 2575-2576 2597 2605 
2614 2617 2762 2809 2816 2879-2880 2911-2924 2926 2961 
2978-2980 2985-2986 2993 2995-3002 3032 3042 3051 3058 
3069 3081 3091-3094 3109 3114 3132-3133 3135 3 137-3 138 
3141 3191 3196 3199 3215 3263 3272 3282-3286 3317 3349- 
3350 3353-3359 3361-3362 3364-3367 3370-3393 3396-3399 
3403 3406 3425 3451 3465-3466 3479 3500 3503 3537-3538 
3544 3548 3550 3555 3557 3590-3591 3595 3604 3606-3612 
3614-3621 3623-3627 3634 3689 3697-3698 3701-3709 371 1- 
3713 3720 3722 3737 3757 3797 3839 3885 3898 3988-3989 
3996 4005-4006 4008 4010-401 1 4019 4021-4022 4025 4040 
4043 4054-4055 4058-4060 4078 4183 4192 4194 4201 4246 
4253 4269 4277 4282 4341 4351 4391 4403 4405 4434 4517 
4543 4553 4590 4615 4622 4631 4633-46344641 4654 4664 
4666-4667 4672-4675 4688-4689 4693-4694 4738 4740 4755 
4783 4828 4870-487 1 4905-4906 4909-49 15 491 7-49 1 9 492 1 - 
4924 4926-4927 4940 4957 4972 4985 4991 4998 5001 5030 
5037-5038 5040 5076 5118 5163 5171-5175 5177-5178 5180- 
5182 5266-5268 5380 5422 5481 5490 5492 5506-5507 5523 
5534-5535 5581 5656 5682 5711 5740 5766 5788 5895 5908 
5923-5924 5938 6047 6057 6083 6085 6104 6110-61 1 1 6117 
6147 6184-6185 6189 6195 6215 6255-6257 6259-6266 6268 
6272 6288 6307 6336 6375-6379 6382 6427-6429 6458 6555 
6588 6595 6607 6629 6665 6689 6765 6767 6799 6815 6871- 
6873 6879 6881-6882 6898 6926-6928 6975-6978 6987 7058 
7090 7148 7313-7314 7341 7343-7344 7346-7347 7350 7354 
7359 7362 7373 7380 7394 7402 7407-7408 7410 7413 7415- 
7416 7418-7419 7426 7431 7468 7473 7480 7494 7505 7513- 
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7516 7520 7544 7561 7584 7587 7599 7601-7604 7635 7638- 
7643 7645-7649 7651-7655 7657 7659-7662 7733 7735 7743 
7748 7783-7796 7815 7852 7857 7862-7863 7865-7870 7930 
7933 7983 8062-8065 8067 8087-8088 8093 8095 81 10 81 16 
8120 8139-8140 8224 8226 8235 8262 8298 8336 8344-8345 
8354 8356 8363 8368 8372 8378 8387 8410 8427 8452 8456 
8531-8532 8534-8535 8563-8569 8572 8576-8577 8592-8593 
8597 8603 8606 8610 8613-8614 8616-8617 8646 8648 8670 
8681 8691 8698-8699 8702 8712 8742 8756 8760 8763 8795 
8807-8809 8811-8814 8816-8819 8821-8833 8835-8839 8858 
8863 8870 8898 8921 8927 8936 8939 8943 8946 8950-8951 
8966 8988-8992 8994-8995 9010 9049 9059 9065-9067 9070- 
9072 9076 9097-9098 9167 9190 9257-9260 9262 9269 9281 
9287 9301 9304 9315 9317 9401 9454-9455 9466 9476 9480 
9484 9556 9577 9612 9689 969^ 9720-9721 9734 9741-9743 
9747-9750 9758 9781 9791 9804 9871-9882 9884-9885 9928 
9939 9942 9954-9960 9999-10000 10005 10175 10179 10275 
10284 10292 10296 10329-10331 10346 10400 10422 10430- 
10431 10437 10442 10444-10447 10475 10511-10512 10514- 
10521 10557 10576-10577 10616 10645 10679 10691 10729 
10742 10744 10772 10774 10777-10778 10782 10788 10839 
10891 10894-10895 10902 10917-10920 10937 10942 10946 
10948 10969-10970 10992 11032 1 1044-1 1045 1 1061 11066- 
11074 11108 11114 11132 11145 11153 1 1 165-1 1 170 1 1 173 
11205 11208 11210-11215 11283 11289 11386 11388 11401- 
11402 11406 11462 11559 11565 11576 11596 11606 11615 
11620 11744 11788-11789 11793 11818 1 1823-1 1825 11831- 
11832 11869 11894 11947 11961 11982-1198912000 12006 
12009 12011 12019-12020 12028 12037 12044 12078 12081 
12093 12119-12122 12143 12160 12166 12172-12175 12177- 
12179 12197 12305 12335 12363 12375-1237612383 12387 
12400 12402 12405 12424 12428 12438 12479 12521 12523 
12546 12560 12564 12570 12590 12599 12601 12605 12609 
12611 12616-12618 12653 12662-12663 12670 12674 12688 
12717-12720 12734 12752 12754 12772 12905 12907 12914 
12916-12917 12923 12925 12940 12961 12963-12965 129S9 
12991 12993 13020-13021 13033 13035 13072-13073 13082- 
13083 13104 13117 13126 13132 13136 13141-13142 13148 
13169 13203-13210 13212 13260-13261 13293 13295 13326 
13377 13394 13413 13442 13456 13477 13480 13488 13490 
13494-13496 13501-13503 13506-13507 13511 13516 13533 
13568 13570-13571 13575 13582 13592 13613 13624-13630 
13632 13644 13646 13659-13660 13678-13679 13689 13701 
13711-13713 13775 13782 13795 13797 13866-13869 13872 
13882 13884-13885 13893 


adult kidney 


GIBCO 


AKD001 


49 67 83-84 142 354 405-407 415-429 43 1-432 445 460 462 
479-484 486 488 492-493 524 548-549 598 616 63 1-638 744 
787 809 841 1004 1068 1086 1160 1163-11661171 1184 1193 
1346 1359 1449 1479 1516 1552 1556 1671 1724 1727 1774 
1826 1858 1914 1974 1976 1978-1979 2081 2097-2099 2111- 
2113 2117-2118 2129-2130 2138 2145 2148-2150 2161 2186 
2218 2266 2268 2302 2310 2327 2343 2363 2409 2412 2444 
2449 2468 2483 2523 2569 2576 2614 2617 2827 2845 2876 
2910 2915 2926-2931 2933-2934 2938-2943 2945-2947 2955- 
2956 2976 2981 2983-2984 3001-3016 301 8 3053 3109 31 14 
3140 3142-3146 3149-3150 3199 3254 3265 3272 3283-3284 
3286 3333 3350 3356 3378 3406 3435 3445 3460 3492 3503 
3535 3537 3544 3548 3590-3591 3598 3625 3627 3634 3676- 
3677 3679-3685 3689 3697 3709 3711 3720 3722 3737 3757 
3797 3808 3810 3839 3885 3989 4005-40064011 4019 4022 
4025 4040 4043 4054-4055 4060 4078 4109 4192 4194 4201 
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4246 4253 4269 4277 4341 4354 4387-4388 4390 4398 4402 
4405 4410 4434 4473 4540 4543 4583 4591-4592 4607 4637 
4640-4641 4649 4654 4665 4668 4676-4677 4680-4685 4688- 
4689 471 1-4712 475 1 4758 4766-4768 4783 4808 4833 4836- 
4837 4845 4874 4904 4940 4957 4962-4963 4972 4983 4991 
4998 5022 5037 5040 5076 5093 51 18 5143 5163 5171-5172 
5246-5249 5281 5362 5364 5380 5422 5481 5521 5523 5526- 
5527 5535 5656 5693 5726 5740 5766 5769 5778 5788 5794 
5894 5902 5908 591 1 5916-5917 5923 5928 5938 5990 6005 
6049-6051 6057 6078 6082 6086 6088 6105-6107 6112-6117 
6120-6122 6147 6159 618^-6189 6215 6257 6272 6288 6369 
6418 6423 6430 6555 6595 6598 6738 6743 6765 6767 6793 
6799 6805 6815 6836 6857 6867-6870 6872-6873 6878 6884 
68S6 6893 6899-6900 6931 6947 6976 6979 6988 7024 7045 
7058 7104 7118 7155 7275 7281 7306 7312 7338 7347-7348 
7351-7352 7355 7357 7359-7363 7366 7369 7371-7373 7381 
7383 7386 7400 7402 7404-7406 7408 7420 7422 7424 7426- 
7428 7430-7431 7433-7435 7439 7455 7465-7468 7494 7502 
7506 7517-7521 7579 7587 7604 7634 7639 7642 7733 7735 
7743 7748 7792 7797 7839 7842-7846 7848 7852 7857 7862 
7864 7870 7930 7933 8065 8077 8084 8087 8093 8095 8105 
8110 8116 8120 8139-8140 8156 8224 8226 8235 8262 8320 
8336 8345 8351 8354 8359 8364 8368 8370 8372 8375 8377 
8387 8427 8452 8456 8461 8486 8518 8520 8527 8535 8563- 
8564 8566 8572-8573 8575-8577 8580-8583 8585 8588-8589 
8597 8603 8618-8621 8623 8625-8626 8628 8630 8634 8647- 
8650 8664 8674 8677-8678 8691 8694 8701-8704 8706 871 1 
8722 8740 8742 8763 8792 8798 8806 8812-8813 8816 8830 
8838-8839 8848 8853 8863 8876 8898 8921 8935 8943-8944 
8948 8951 8966 8989 9012 9041 9047-9049 9076 9092 9128 
9137 9167 9218 9244 9249 9257 9264 9287 9304 9315 9317 
9424 9455-9457 9466 9472 9475 9484 9503 9511 9517 9529 
9536 9542 9544 9558 9612 9683 9699 9703 9722-9723 9725 
9734 9744 9752-9755 9758 9772-9773 9780-9781 9805-9807 
9841 9843 9927-9929 9939 9942 9969 9998-10000 10007 
10175 10275 10284 10287 10292 1*0319 10346 10376 10423- 
10424 10431 10442 10446 1 0448 10450 10475 10480 10496 
10542 10557 10645 10679 10774 10778 10782 10788 10895 
10921-10928 10947 10949-10953 10986 10994-10996 1 1032 
11052 11061 11069 11114 11130 11132 11143 11145 11197- 
11202 11205 11208 11212 11239 1 1289 1 1401-1 1402 11406 
11459 11513 11586 11596 11604 11607 11618 11620 11695 
11711 11759 11761 11790-11791 1 1793-1 1794 1 1818 11820- 
11822 11834-1183611857 11865 11869 11895-11899 11950 
11970 12006 12022 12041 12078 12143 12159-12160 12178 
12195 12197 12256 12265 12305 12327 12359 12363 12375 
12387 12405 12424 12428 12438 12442 12467 12511 12521 
12523 12535 12539 12546 12564 12570 12590 12599 12601 
12609-12611 12616 12618 12630 12653 12656 12664 12666- 
12667 12670 12674 12688-12689 12691 12739 12754 12830 
12834 12904 12913-12914 12916-12917 12923 12940-12941 
12951-12952 12956 12972 12976-12977 12993 12999 13002 
13052 13062 13064 13066 13072-13073 13082-13083 13095 
13104 13127 13131 13133 13136 13141 13143-13146 13148 
13154-13155 13169 13172 13184 13195-13196 13249 13254 
13260 13263 13280 13295 13307 13319 13394 13442 13456 
13477 13479-13480 13488-13490 13492 13494-13495 13497- 
13503 13506-13507 13512 13515-13516 13546 13551 13554- 
13556 13572-1357413590 13592 13613 13627 13631 13644- 
13645 13656 13659-13660 13665 13670 13710 13713 13776 
13782 13795 13859-13860 13864 13866-13868 13872 13882- 
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Origin 


Source 


Name 










13885 13888 13891-13892 


adult kidney 


Invitrogen 


AKT002 


67 142 550-552 598 639-641 1004 1015 1493 1516 1947 2215 
2299 2343 2363 2449 26 1 8 3054-3055 3 1 52-3 153 3 1 55-3 1 57 
3434 3535 3590 3709 3797 3808 3839 3885 401 1 4022 4025 
4040 4253 4277 4391 4405 4671 4759 4767 4769 4837 4949 
4972 5001 5035 5037 5052 5108 5526 5581 5615 5726 5788 
5895 6062 6139 6190-6191 6803 6900 6947 6975 7468-7469 
7473 7733 8095 8110 8139 8262 8323 8361 8363 8375 8520 
8539 8648 871 1 8798 8912 8950 8966 8983 9076 9264 936S 
9510 9517 9665 9703 9734 10175 10476 10791 10989 10997 
11132 11618 11745 11900 12006 12039 12160 12363 12375 
12405 12424 12685 12702 12707 12904 13035 13060 13104 
13136 13295 13394 13456 13488 13495 13501 13512 13533 
13554 13583 13644 13670 13679 13696 13713 13795 13866 
13888 13891 


adult lung 


GIBCO 


ALG001 


83-84 553 598 642-644 650 747 975 1004 1009 1015-1022 
1449 1516 1816 2161 2186 2215 2343 2444 2968 3056 3158 
3160-3162 3345 3466 3503 3519 3566 3568-3573 3595 3709 
3797 3810 3839 3885 4025 4039 4059 4194 4405 4622 4755 
4767 4770 4797 4865 4940 4949 4963 4972 4987 4991 4998 
5001 5117-5129 5171-5172 5233 5401 5481 5581 5724 5788 
5938 5960 6123 6125 6140 6215 6322 6336 6343-6348 6371 
6886 6966-6967 7024 7143 7275 7441 7444 7502 7522-7523 
7749-7752 8093 8110 8140 8336 8345 8375 8378 8449 8535 
8612 8622 8635 8648 8651 8674 8702 8707-8708 8838-8839 
8898 8941-8942 8944 8948 8951 9076 9128 9457 9466 9475 
9497 9503 9544 9567 9703 9756 9758 9808-9810 9843 9873 
9926 9999 10161 10175 10275 10329 10344 10400 10458 
10554-10555 10796 10998 11032 1 1046 1 1069 1 1 132 11144- 
11145 11483 11627 11818 11837 11901 12006 12028 12039 
12041 12087-12092 12146 12363 12375 12424 12438 12539 
12570 12601 12617 12661 12670 12674 12688 12738 12749 
12754 12904 12940 12974 13062 13173 13195 13232 13234 
13260 13295 13307 13456 13472 13477 13490 13494 13497- 
13498 13569 13584 13592 13660 13663-13666 13670 13743 
13776 13882 13885 13889 13891 


lymph node 


Clontech 


ALN001 


83 142 364 487 495-497 554 629 645-646 648-650 716 938-951 
953-962 1134 1516 1549 1671 1774 1976 2138 2225-2232 
2234-2235 2340 2879 3019-3020 3057 3103 3163-3165 3272 
3356 3498-3505 3507-3513 3515-3516 3538 3548 3628 3697 
4194 4201 4253 4405 4641 4687-4688 4771-4772 4783 4808 
4845 4963 4972 4987 4998 5042-5065 5076 5163 5504 5523 
5835 5895 5917 6027 6142 6192 6272 6288 6308-6311 6313- 
6314 6765 6805 6871 6949-6951 7700-7705 7707-7709 8085 
8088 8091 8110 8235 8375 8387 8432 8629 8631 8633 8648 
8677 8709 8713-8715 8830 8863 8887-8889 8891-8896 8943- 
8944 8966 9010 9076 91 1 1 9128 9142 9222 9455 9472 9520 
9544 9734 9774 9730 981 1 9905-9907 9928 9939 9999 10027 
10129 10296 10439 10452 10501 10543-10545 10679 10777- 
10778 10788 10891 10999-11000 11111-11113 11115 11130 
11145 11344 11406 11513 11584 11SS5 12006 12028 12050- 
12056 12143 12256 12363 12405 12442 12570 12674 12690 
12913 12917 12940 12968-12970 13173 13220-13221 13394 
13400 13492 13503 13511 13533 13630 13642 13645 13713 
13868 13885 13889 13891 


young liver 


GIBCO 


ALV001 


211 498 598 651-653 1008-1014 1 193 1264 1575 1976-1977 
2131 2161 2254-2255 2269 2363 2568 2617 2627 2633 2636 
2961 3021 3059 3093 3 166 3215 3272 3356 3378 3426 3479 
3559-3565 3590 3597 3627 3634 3673 3709 3797 3810 3885 
3993 4006 401 1 4019 4025 4194 4246 4253 4277 4422 4426 
443 1 4434 4437-4438 4634 4654 4687 4714 4722 4755 4773- 



105 



WO 01/64835 



PCT/US01/04927 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








4774 4808 4957 5001 5037 5052 51 13-51 15 5163 5167 5171 
5508 5510 5526 5581 5616 5693 5126 5151 5772 5788 5928 
6125 6141-6142 6193 6288 6340-6342 6589 6765 6815 6886 
6901 7024 71 18 7141 7442 7524 7735 7747-7748 7792 7852 
7870 7924 7930 8066 8110 8120 8262 8375 8378 8518 8520 
8648 8652 8691 8702 8716-8718 8816 8838-8839 8938-8940 
9097 91 19 9263-9264 9301 9317 9424 9454-9455 9544 9689 
9703 9734 9758 9775 9925 9947 10393 10477 10769 10774 
11001 11132 11140-11142 11173 11208 11318 11406 11587- 
11588 11725 11804 11902 12081-12086 12118 12160 12178 
12375 12377 12405 12424 12479 12511 12570 12599 12601 
12618 12670 12674 12688 12737 12830 12913 12916 12933 
12953 12977 12993 13047 13062 13073 13126 13169 13174 
13230-13231 13295 13488-13489 13494 13498-1349913502 
13506-13507 13575 13592 13646 13660-13662 13670 13866 
13868-13869 13882 13888 13901 


adult liver 


Invitrogen 


ALV002 


6 25 60 142 598 1004 1213 1296-1301 1381 1493 1513-1514 
1977 2139 2161 2269 2310 2335 2342 2400 2413 2449 2458 
2497 2568-2569 2576 2636 2809 2827 2880 2926 3471 3484 
3503 3597 3722 3792-3796 3885 3994-3995 4025 4060 4201 
4326 4422 4426 4618 4689 4767 4786 4985 4998 5093 5163 
5337-5339 551 1 5526 5581 5693 5701 5724 5726 5757 5895 
5922-5923 5979 6020 6027 6083 6125 6192 6195 6253 6333 
6346 6483-6485 6716 6765 6797 6975 7169 7422 7468 7614 
7642 7807 7932 8089 81 10 8140 8262 8323 8378 8633 8677 
871 1 8740 8759 8786 8834 8839 8944 9002 9128-9129 9264- 
9265 9282 9466 9484-9485 9517 9677 9700 9773 10007 10046- 
10047 10135 10616 10669 10777 1 1032 1 1 1 14 11132 11142 
11194 11271 11389 11462 11502 11560 11587 11602 11818 
11980 11989 12086 12160 12195 12253-12254 12316 12327 
12363 12378-1237912411 12424 12511 12570 12630 12693 
12706 12788-12789 12840 12863 12913-12914 13047 13062 
13072-13073 13090 13242 13256 13378-13379 13389 13479 
13487 13489 13497 13555 13734-13738 13859 13864 13889 


adult liver 


Clontech 


ALV003 


346 2529 3548 3797 3885 4438 4940 5101 5801 5902 9597 
10769 10778 11587 11927 12086 


adult ovary 


Invitrogen 


AOV001 


6 13 40 67 83 88 142 196 444 522 555-560 577 579 598 609 
654-657 659-665 667-668 708 740 745-749 751-754 756-761 
806 841 871 1004 1023 1028-1034 1036-1039 1041-1047 1055 
1087-1115 1119 1151-1162 1298 1346 1359 1456 1493 1507 
1516 1658 1697 1752 1774 1812 1826 1848 1914 1974 1976- 
1977 1979 1983 2099 21 1 1 2138 2153-2155 2161 2175 2180- 
2181 2186 2258-2267 2284-2286 2288-2290 2298-2299 2342- 
2343 2449 2483 2523 2529 2555 2569 2576 2591 2597 2618 
2708 2750 2783 2818 28392926 2961-2962 2988 3007 3022 
3 03 1 3060-306 1 3 063 3 069 3 1 66-3 1 72 3 1 74-3 1 75 3 1 77-3 1 89 
3226 3272 3286 3288-3289 3291-3293 3295-3299 3435 3450 
3460 3479 3486 3503 3535 3577-3589 3612 3627 3629 3631- 
3643 3673 3675 3720 3722 3737 3797 3806 3810 3812-3813 
3839 3885 3985 4006 4012-4014 4023 4025 4040 4043 4060 
4078 4133 4192 4194 4201 4246 4269 4277 4282 4341 4362 
4381 4383 4385 4388 4390-4391 4396 4399 4402 4405 4410 
4434 4465 4473 4515 4543 4582 4610 4654 4689 4694 4715- 
4717 4722 4759 4767 4775-4782 4836-4837 4873-4878 4880 
4940 4944 4957 4960 4962-4964 4972 4975 4981 4985 4998 
5001-5002 5018 5030 5037 5040 5045 5075-5076 5093 5108 
5135-5136 5138-5146 5148 5163 5191-5209 5213 5238-5245 
5262 5374 5380 5451 5455 5490 5503 5505 5521 5523 5527 
5560 5581 5674 5701 5724 5757 5778 5783 5788 5794 5817 
5874 5894-5895 5902 5906 5908-5909 5916-5917 5923 5932 
5938 5979 5989-5990 6005 6027 6048 6054 6057 6061 6078 
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6082 6086 6125 6140 6143-6144 6165 6171 6192 6194-6200 
6215 6235-6239 6241 6270 6288 6307 6333 6336 6354-6360 
6362-6364 6374 6385-6392 6410 6412 6415-6417 6458 6461 
6508 6555 6583 6595 6598 6604 6607 6624 6626 6629 6708 
6765 6767 6793 6797 6799 6801 6805 6825 6860 6870-6871 
6873 6886 6888 6900 6902-6904 6915-6917 6947 6969-6970 
6975 6980-6987 6998 7054 7058 7104 7109 7118 7133 7137 
7141 7155 7166 7169 7171 7191 7204 7281 7386 7408 7431 
7443 7468 7471-7472 7494 7507 7525-7528 7604-7609 7614 
7642 7687 7716 7729 7762-7763 7765-7767 7769-7771 7773- 
7775 7788 7802-7818 7829-7835 7838-7841 7864 7905 7937 
7977 8007 8035 8077 8088 8093 8095 8110 8120 8134 8139- 
8140 8143 8156 8235 8246 8262 8292 8320 8336 8345 8355 
8358 8363 8365 8368 8370 8372 8375 8387 8444 8452 8456 
8486 8520 8593 8610 8633-8634 8653 8719-8724 8773-8777 
8779 8781-8782 8792 8798 8813 8830 8839 8853 8863 8870 
8876 8898 8928 8944 8950 8956-8961 8963-8971 8989 9004- 
9007 9009-9019 9042-9046 9055 9076 9097 9128 9134 9185 
9264 9283-9284 9291 9313 9358 9424 9445 9454-9455 9457 
9460 9466 9471-9472 9475 9480 951 1 9517 9533 9539 9542 
9544 9553 9565 9577 9586 9612 9626 9677 9694 9703 9722 
9734 9757-9758 9773 9776-9778 9791 9812-9819 9827 9838 
9843 9848-9853 9922 9927-9940 9942 9962-9970 9989-9992 
9994-9997 10007 10027 10097 10149-10152 10249 10252 
10275 10298 10333-10334 10346 10349 10351 10383 10386 
10417 10438-10439 10452 10459 10470 10478-10480 10493- 
10498 10557-10562 10564-10565 10580-10586 10594-10596 
10645 10658 10676 10679 10767 10772 10774 10778 10782 
10788 10791 10954 10971-10972 11002-11004 11032 11047- 
11052 11061 11069 11083 11108 11130 11 132 11151-11153 
11155 11157-11158 11167 11170 11172-1 1176 11193-11196 
11274 11343 11369 11406 11411 11431 11588 11596 11600 
11604 11606-11607 11618 11620-11621 11629 11668 11814 
11860 11865 11876 11903-11909 1 1932 1 1949-1 1954 1 1980 
11985 12000 12006 12017 12022 12033 12039 12080 12083 
12092-12093 12095-12104 12127-12129 12131-12142 12146 
12159-12160 12162-12163 12178 12213 12216 12236 12256 
12260 12305 12327 12363 12368 12371 12375 12379 12387 
12401-12402 12405 12424 12430 12467 12520 12522 12546 
12570 12576 12590 12594 12599 12605 12609 12611 12615 " 
12617-12618 12630 12636 12643 12657-12658 12663 12670 
12674 12685 12688 12691 12693 12702 12705-12707 12713 
12724 12729 12735 12740-12745 12749 12754-12760 12765- 
12770 12777 12842 12848 12875 12904 12906 12910 12913 
12916-12917 12935 12940 12955 12957 12963 12972 12977- 
12980 12982-12984 12992-12994 12997-13000 13020 13034 
13047 13051 13054 13056 13060 13062 13066 13070 13072- 
13073 13075 13082 13090 13092-13093 13104 13126 13136 
13141 13175-13177 13179-13180 13193-13194 13196-13197 
13202 13236-13241 13243 13245 13249 13254 13263-13275 
13277 13286-13292 13295 13319 13351 13377 13389-13391 
13394 13420 13436 13456 13477 13479 13488 13492 13494- 
13495 13497-13499 13502-13503 13506 13512 13516 13533 
13546 13549 13554-13555 13575 13590 13597 13600-13601 
13613 13616-13620 13627 13631 13644-13645 13649 13659- 
13660 13670 13674-13679 13693-13699 13707-13709 13713 
13782 13803 13864 13866 13868-13869 13872 13875 13882- 
13885 13888-1388913892 


adult placenta 


Clontech 


APL001 


669-671 1006 1134 1184 1551 2053 2090 2156-2158 2342 
2490 2716 3001 3064-3065 3190 3272 3625 3670 4019 4055 
4194 4201 4246 4641 4718 4720-4721 4783-4786 4957 5523 
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5536 5581 5788 5895 6145-6146 6201-6202 6358 6803 7049 
7204 7529 8367 8375 8609 861 1 8630 8725-8727 9097 9480 
9734 9820 10319 10460 10579 10795 10895 11519 11723 
11737 11776 11827-11828 11959 12403 12546 12570 12663 
12848 12875 12913 13051 13394 13782 13869 13889 


placenta 


Invitrogen 


APL002 


142 561 1023 1551 1976 2449 2569 2614 3191 3340 3592 3668 
4060 4346 4767 4787-4789 4892 4985 5536 5674 5693 5757 
5772 5794 5906 6147 6203 6215 6598 6788 6799 6979 7158 
7530-7531 8139 8361 8611 8863 9457 9484 9517 9563 9703 
9758 9821 9927 9939 9942 10481 10557 10778 10802 10973 
11006-11011 11723 11731 11737 11776 11807 11866 11910- 
11916 11959 12379 12403 12590 12685 13020 13054 13392 
13489 13533 13554-13555 13867 13882 13888 13891 


adult spleen 


GIBCO 


ASP001 


463 499 562-563 598 672 990-993 995 1004 1020 1346 1515- 
1516 1556 1774 1877 1914 1977 2159 2161 2215 2248 2257 
2363 2529 2569 2614 2617 2812 2827 3093 3 192 3195 3199 
3272 3286 3350 3356 3434 3470 3503 3537 3544-3546 3548 
3604 3634 3689 3709 3720 3722 3796-3798 3812 3839 3885 
4006 401 1 4022 4025 4043 4194 4201 4246 4253 4277 4388 
439 1 43 96 4405 4434 464 1 4689-4690 478 1 4957 4987 4998 
5001 5017 5030 5037 5052 5076 5118 5512-5513 5523 5526 
5701 5740 5778 5788 5796 5801 5895 5938 6005 6064 6125 
6140 6147 6192 6204 6272 6329 6458 6551 6590 6607 6873 
6886 6931 6958 7018 7058 7109 7118 7133 7171 7233 7506 
7532-7533 7561 7586 7733 7736 7807 7842 7845 7933 8065 
8085 8093 8095 8110 81 16 8139-8140 8226 8235 8262 8323 
8359 8363 8365 8368 8372 8456 8535 8648 8702 8792 8816 
8838-8839 8858 8863 8865 8876 8923-8928 8943 8950-8951 
9128 9257 9264 9317 9455 9457 9466 9474 9544 9560 9773 
9781 9919-9920 9927 9939 9942 10000 10048-10049 10155 
10175 10267 10275 10296 10331 10549 10566 10774 10777 
10828 10944 11032 11061 11108 11113 11132 11173 11272 
11401 11406 11620-11621 11804 11818 11861 11917 12006 
12039 12041 12073-12074 12078 12118 12127 12236 12327 
12375 12387 12402 12405 12424 12426 12511 12546 12560 
12570 12601 12617-12618 12670 12674 12734-12735 12749 
12772 12790 12913 12916-12917 12972 12977 13002 13021 
13066 13082 13084-13085 13169 13228-13229 13256 13260 
13295 13316-13317 13394 13456 13488-13489 13494 13497 
13512 13516 13546 13645 13649 13651 13659 13679 13739 
13776 13864 13866-13867 13888 


testis 


GIBCO 


ATS001 


50 142 500-502 564-565 598 673-678 963-968 1 193 1346 1556 
1671 1826 1968 1977 2160-2161 2236-2237 2555 2590 2597 
3024 3093 3196-3202 3284 3378 3517-3522 3524 3720 3797 
3839 3885 4006 4025 4055 4109 4253 4354 4377 4388 4405 
4434 4473 4480 4543 4688 4792 4940 4957 4962 4964 4975 
500 1 50 1 8 5030 503 7 5067-5072 5076 5 1 1 8 5 1 7 1 548 1 5525 
5535 5740 5757 5788 5907 5924 6125 6147 6257 6316-6322 
6324 6607 6886 6952 7058 7141 7288 7445 7484 7534-7540 
7677 7710-7711 7713-7717 7735 7792 7815 8087 8110 8120 
8262 8368 8370 8375 8535 8577 8648 8655 8702 8729 8838- 
8839 8848 8898-8906 8936 8939 8966 9076 9087 9244 9264 
9304 9455 9466 9485 9542 9567 9703 9758 9822-9825 9908- 
9910 9928 10462 10475 10478 10482 10551 10583 10679 
10772 10778 10795 10955 10974 11108 11113 11116 11118- 
11120 11132 11173 11239 11425 11606 11620 11629 11695 
11807 11862 11918-11919 12000 12006 12017 12033 12057- 
12061 12375 12405 12424 12570 12577 12599 12601 12605 
12609 12668 12692 12730-12731 12748 12904 12917 12923 
12929 12935 12940 12956 12971 13047 13051 13073 13090 
13169 13222-13225 13249 13394 13442 13456 13479 13494 
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13497 13502-13503 13506 13512 13515 13533 13549 13555 
13575 13613 13616 13627 13644-13646 13660 13866 13875 
13882 13889 


Genomic 
DNA from 
BAC 63118 


Research 

Genetics 

(CITB 

BAC 

Library) 


BAC001 


8711 


adult bladder 


Invitrogen 


BLD001 


731 1710 1779-17912076 2367 2479-2481 4067 4208-4214 
4681 4767 4775 5163 5169 5553 5560 5581 5674 5684-5688 
5794 6082 6244 6716 6718-6721 6799 6931 7045 7094-7097 
7967 81 10 8226 8249-8258 8364 8648 8737 9012 9097 9425- 
9428 9430 9626 9703 9928 10007 10261 10557 10734-10735 
11459 31504-1150612000 12212 12512-12513 12515-12517 
12670 12735 12914 13085 13439 13512 13687 13838-13841 
13866 


bone marrow 


Clontech 


BMD001 


11 70 83 85 142 150 162-184 186-198 200-210 230-243 245- 
277 279-281 306 370 373-387 389 433-435 438-440 457 483 
503-510 566 568-569 595-596 598 609 616 679-683 740 770 
843-857 859 861 864 867-876 880-884 886-890 892-893 896 
900 902-920 1000 1 004 1 1 16-1 120 1 122-1 126 1 128-1 134 3 3 84 
1211 1346 1359 1516 1552-1554 1575 1583 1671 1724 1774 
1877 1894 1927 1974 1976 1983 2012-2024 203 1 2033-2038 
2040 2043-2044 2084-2088 21 1 1 2120-2121 2132-2133 2137- 
2138 2161 2163-2164 2186 2189 2199-2200 2202-2203 2205- 
2210 2213-2218 2266 2291-2295 2342-2343 2414-2416 2444 
2529 2555 2566 2569 2575-2576 2591 2597 2652 2681-2709 
271 1-27 16 2735-2738 2740-2744 2746-2748 2750-2756 2758- 
2759 2761-2764 2766-2768 2770 2772-2781 2783-2787 2806 
2812-2813 2816 2873 2875 2880-2881 2883-2885 2887-2897 
2899 2901 2926 2948 2954 2958 2977 2984 3026-3029 3032 
3068-3069 3071 3081 3093 3096-3097 3139 3203-3205 3207 
3238 3257 3272 3282-3283 3286 3317 3345 3356 3404-3418 
3422-3434 3437-3438 3440 3442 3447 3449-3450 3456-3459 
3461-3464 3466-3473 3483 3497 3535 3538 3548 3557 3572 
3588 3593 3600 3604-3605 3610 3612 3625-3627 3634 3644- 
3647 3649-3651 3653-3657 3709 3711 3722 3725 3727 3737 
3797 3804 3808 3810 3839 3899 4005-4006 401 1 4015-4019 
4023 4025 4040 4043 4058 4060 4129 4132 4192 4194 4201 
4246 4253 4277 4282 4367 4403 4466-4472 4474-4477 4479- 
4484 4486-4490 4492-4497 4509 4512-4540 4582 4595 4597- 
4606 4608-4615 4622 4642 4648 4650 4654 4667 4691-4695 
4723 4741-4742 4759 4767 4783 4794-4796 4808 4836 4928- 
4929 493 1-4934 4938 4940 4942 4944-4948 4950-4953 4955- 
4960 4962-4964 4967-4971 4973 4976-4979 4985-4987 4992- 
4998 5000 5004-5016 5030 5037 5052 5075-5076 5093 5143 
5149-5151 5163 5169 5210-5215 5217-5226 5262 5317 5357 
5503 5523 5535 5537-5540 5560 5604 5695 5740 5748 5766 
5788 5796 5801 5862 5874 5895 5906 5908 5938 5965-5967 
5969-5979 5991-5994 5996-6005 6007-6009 6027 6047 6057 
6065 6067-6068 6071-6072 6082 6086 6089-6091 6112 6125- 
6127 6148-6152 6195 6214 6233 6257 6269-6276 6278-6280 
6282-6283 6285-6289 6292-6300 6321 6374 6393-6403 6508 
6555 6605 6607 6722 6730 6788 6815-6822 6826-6832 6836 
6859-6861 6867 6870-6871 6873-6874 6905 6915 6929 6933- 
6937 6940-6942 6944-6945 6988-6992 7051 7126 7155 7166 
7169 7199-7210 7212-7219 7226 7233-7253 7256-7264 7275 
7317-7318 7320 7322-7326 7347 7349 7373-7374 7379 7382 
7446-7448 7473-7474 7491 7502 7542 7544 7569 7577 7579 
7663-7671 7674 7677-7678 7680-7684 7686 7689-7696 7742 
7768 7791 7815 7819-7824 7847 7864 7930 7995 8065 8085 
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8088 8090-8093 8095 81 10 8120 8134 8139 8224 8226 8235 
8241 8246 8262 8336 8354 8359 8363 8365 8368 8375 8421- 
8427 8429-8440 8444 8452 8456-8460 8463-8465 8467-8476 
8478-8489 8538-8539 8542-8551 8563 8566 8577 8593-8595 
8597 8603 8633 8635-8637 8648 8656-8657 8671 8677 8679 
8701 8792 8796 8817 8830 8840-8845 8847-8849 8852 8855- 
8860 8863-8868 8870 8873 8875 8879-8881 8898 8935 8944 
8963 8965 8995 9010 9012 9020-9031 9049 9051 9066 9076 
9092 9097-9098 9128 9257 9264 9285-9288 9304 9317 9339 
9381 9399 9401 9454-9455 9466 9471-9472 9474-9476 9516- 
9517 9519 9529 9544 9622 9630-9643 9646 9650-9665 9689 
9705-9712 9730 9734 9740 9758 9779-9781 9826 9885-9889 
9892-9902 9928-9929 9939 9942 9972-9976 9978-9982 9999- 
10000 10068 10153 10175 10249 10275 10277 10284 10287 
10296 10319 10321 10343-10344 10346 10364-10374 10379- 
10382 10385-10394 10414 10416 10426 10429 10439 10447 
10452 10467-10468 10475 10483 10520 10522-10526 10528- 
10529 10531-10535 10587-10590 10595 10616 10677-10679 
10691 10750 10760 10767 10772 10778 10788 10794-10795 
10828 10831-10837 10846-10847 10849-10851 10853-10858 
10891 10895 10897 10899-10903 10905-10907 10931-10933 
10956-10959 10975-10976 11012 11032 11046 11057 11059 
11061 11077-11082 11085-11086 11088-11098 11108 11113 
11132 11145 11177-11181 11194 1 1208 1 1289 1 1344 11401- 
11402 11513 11618 11620 11631 11673-11675 11677-11690 
11695 11697-11704 11706 11708-11715 11736 11742 11762- 
1 1772 1 1774 1 1795-1 1796 1 1802-1 1804 1 1817-11818 1 1829 
11839-11842 11863 11920 11946 11992-12002 12004-12008 
12010 12012-12013 12015-12019 12022-12029 12033 12041 
12081 12142-12149 12160 12178 12195 12200 12231 12236 
12283 12305 12308 12316 12327 12363 12368 12404-12405 
12424 12426 12430 12439 12527 12546 12570 12576 12590 
12608 12610 12616 12634-12636 12641-12645 12654-12655 
12657-12659 12669-12670 12674 12721-12724 12754 12761 
12834 12840 12842 12904-12905 12910-12911 12913 12916- 
12917 12922-12923 12925-12928 12933 12938 12956 12966 
12977 12989-12990 12993 12995 13020 13033 13047 13060 
13062 13064 13066 13095 13098 13103-13105 13107 13111- 
13113 13129 13135 13137 13141 13150 13163 13173 13181 
13213-13217 13234 13276-13278 13280 13391 13394 13400 
13456 13472 13477 13490 13492 13494 13498 13501-13502 
13506-13507 13511 13516 13528 13530-13534 13538 13540 
13546 13551-13552 13554-13555 13560 13585-13586 13594 
13613 13630 13633-13639 13644-13646 13649 13659-13660 
13670 13673 13679 13713 13775 13795 13S66 13872 13875 
13882-13885 13889 13891 13893 


bone marrow 


Clontech 


BMD002 


51 242 442 654 1004 1134 1841-1904 1908 1927 2023 2107 
221 5 2342 2408 2507-2529 2576 2597 2806 2866 3286 3434 
3722 3736-3737 3817 3823 3839 4060 4246 4258-42904389 
4396 441 1 4618 4641 4828 4836 4957 4987 5030 5037 5052 
5108 5163 5526 571 1 5743-5772 5774-5804 5813 5895 5939 
6131 6236 6266 6287 6306-6307 6333 6655 6675 6728 6730 
6750-6762 6764-6769 6788 7049 7106-7117 7137 7156 7738 
8068 8085 8134 8156 8290-8297 8299-8305 8307-8312 8323 
8368 8482 8499 8858 8863 8865 8928 8935 9076 9111 9128 
9465-9502 9626 9700 9703 9923 9928-9929 9942 10277-10280 
10282-10284 10286-10297 10396 10434 10515 10551 10645 
10675 10691 10744-10753 10772 10778 11057 11098 11108 
11132 11232 11252 11519 1 1536-1 1553 11606 11620 12033 
12039 12146 12260 12305 12387 12402 12405 12500 12554- 
12578 12594 12599 12608 12674 12754 12777 12839 12895- 
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12904 12916 12923 13020-13021 13043 13057-13064 13169 
13249 13446-13454 13479 13492 13494 13513 13533 13555 
13659 13713 13775 13803 13863-13875 13877-13887 13889 
13891 


bone marrow 


Clontech 


BMD004 


2249 2529 3286 3494 3548 355 1 3797 3839 4025 4058 4201 
4277 4282 5052 5108 6545 6961 8262 8898 9474 10000 1 1098 
11818 13021 13893 


bone marrow 


Clontech 


BMD007 


8539 9780 9927 13021 


adult colon 


Invitrogen 


CLN001 


319 346 487 731 799 1792 1848 2050 2161 2449 24 82-2483 
3431 3901 4215-4217 4940 4957 4987 5163 5239 5560 5689- 
5695 5865 591 1 5923 6722 6765 7098 7815 7864 7880 81 10 
8259-8262 8486 8597 8951 9484 9529 9542 9556 10376 1 1507- 
1 1508 1 1617 1 1869 12127 12236 12424 12518-12523 12601 
12610 12777 12976 13062 13073 13367 13440 13507 13512 
13630 13713 13843-13844 13864 13868-13869 13888 


Mixture of 16 
tissues — 
mRNAs* 


Various 
Vendors* 


CTL016 


6815 10776 12977 13064 13512 


Mixture of 16 
tissues — 
mRNAs* 


Various 
Vendors* 


CTL021 


1671 6738 8432 8648 8863 8944 9511 10769 13021 13062 
13064 


adult cervix 


BioChain 


CVX001 


50 67 142 158 308 332 346 475 598 654 895 1004 1086 1286 
1449 1516 1671 1698 1701 1711-1756 1758-1776 1828 1848 
1959 2134 2186 2257 2267 2343 2408 2414 2468-2474 2476- 
2478 2608 2716 3002 3136 3 166 3191 3199 3529 3535 3554 
3572 3627 3722 3737 3777 3797 3839 3985 4158-4176 4178- 
4195 4197 4199-4207 4246 4277 4391 4396 4434 4641 4667 
4759 4783 4828 4885 4940 4957 4963 4987 4998 5001 5038 
5075 5108 5163 5293-5294 5455 5481 5523 5552 5581 5646- 
5652 5654-5659 5661-5671 5673-5681 5687 5701 5711 5723 
5740 5788 5794 5848 5902 5908 5923-5924 5964 6020 6052 
6057 6062 6091 6106 6112 6125 6129 6181 6350 6371 6374 
6410 6446 6458 6504 6508 6512 6551 6598 6686-6687 6689- 
6705 6707-6715 6788 6873 6893 6917 6998 7008 7045 7078 
7084-7093 7095 7130 7141 7148 7169 7204 7507 7579 7608 
7675 7733 7768 7815 7871 7880 7893 8078 8138 8209-8215 
8217-8236 8238-8242 8244-8248 8298 8345 8370 8444 8456 
8486 8499 8535 8558 8592 8633 8635 8648 8669 8679 8742 
8853 8863 8870 8898 8921 8939 8948 9012 9061 9098 9107 
9128 9137 9153 9304 9308 9317-93 18 9355 9385-9391 9393- 
9403 9405-9406 9408-9418 9420-9422 9457 9466 9475 9510 
9539 9612 9734 9773 9927-9928 9939 9947 9960 101 10 10175 
10230-10256 10258-10259 10267 10274 10319 10329 10344 
10491 10496 10540 10616 10660 10691 10722-10732 10778 
10782 11055 11145 11217 11376 1 1462 1 1477-1 1489 11491- 
11503 11519 11584 11604 11695 11853 11869 11891 11980 
12006 12066 12081 12127 12160 12195 12216 12240 12266 
12308 12363 12379 12402 12405 12424 12438 12483-12494 
12496-12510 12579 12605 12610-12611 12617-12618 12643 
12653 12670 12674 12688 12691 12703 12707 12735 12740 
12754 12830 12840 12866 12870-12881 12883-12884 12905 



* The 16 tissue-mRNAs and their vendor source, are as follows: 1) Normal adult brain mRNA (Invitrogen), 2) 
normal adult kidney mRNA (Invitrogen), 3) normal adult liver mRNA (Invitrogen), 4) normal fetal brain mRNA 
(Invitrogen), 5) normal fetal kidney mRNA (Invitrogen), 6) normal fetal liver mRNA (Invitrogen), 7) normal fetal 
skin mRNA (Invitrogen), 8) human adrenal gland mRNA (Clontech), 9) human bone marrow mRNA (Clontech), 
10) human leukemia lymphablastic mRNA (Clontech), 11) human thymus mRNA (Clontech), 12) human lympfr 
node mRNA (Clontech), 13) human spinal cord mRNA (Clontech), 14) human thyroid mRNA (Clontech), 15) 
human esophagus mRNA (BioChain), 16) human conceptional umbilical cord mRNA (BioChain). 
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12913-12914 12917 12923 12951 12955 12957 12965 12989- 
12990 12993 13002 13020 13035 13045-1305013062 13064 
13072 13092 13136 13141 13174 13197 13254 13283 13307 
13391 13428 13430-13437 13442 13473 13479 13492 13494- 
13495 13497-13498 13502 13532 13554-13555 13575 13590 
13597 13613 13616 13627 13644 13679 13713 13775 13829- 
13837 13866 13868-13869 13872 13884 13888 13891 


diaphragm 


BioChain 


DIA002 


731 1346 3548 371 1 3885 4282 4654 5895 6873 8120 8931 
8936 9455 11132 11818 12405 12609 


endothelial 
cells 


Strategene 

* 


EDT001 


21 51 67 83 332 569 598 609 762 796 1004 1024-1026 1086 
1561 1848 1928 1959 1976-1977 1983 2138 2161 2166 2257 
2282-2283 2417 2483 2490 2555 2569 2614 2926 3042 31 89 
3191 3272 3300-3303 3426 3494 3503 3548 3574-3576 3605 
3627-3628 3673 3709 3720 3722 3737 3797 3839 3885 4005 
401 1 4019 4055 4133 4192 4246 4269 4282 4340 4354 4365 
4384 4388 4399 4405 4410-4411 4434 4543 4641 4654 4767 
4797-4799 4802 4881-4882 4885-4886 4888 4940 4957 4964 
4972 4985 4998 5002 5017 5030 5076 5103-5104 5132-5133 
5163 5167 5183-5185 5187-5190 5380 5523 5527 5535 5541- 
5542 5544 5674 5684 5693 5724 5766 5778 5788 5794 5796 
5874 5895 5916 5923 5928 5938 6005 6048 6057 6068 6082 
6165 6205 6215 6240-6241 6307 6321-6322 6349-6353 6383 
6458 6595 6598 6606-6607 6765 6799 6805 6815 6860 6871 
6873 6890 6918 6968 6972 6976 6979-6980 6998 7058 7067 
7104 71 13 71 16 7137 7139 7169 7275 7468 7613 7716 7755- 
7757 7759-7760 7797 7799-7801 7930 8077 8084 8093-8095 
8120 8139 8235 8262 8320 8323 8335-8336 8345 8354 8358 
8363-8364 8370 8372 8375 8387 8452 8592 8648 8786 8788 
8792 8813 8863 8898 8944-8955 8965 8996-9001 9051 9076 
9097 9128 9264 9289-9291 9304 9315 9414 9455-9456 9466 
9472-9473 9475 9484 9504 9517 9529 9542 9563 9570 9626 
9703 9780-9781 9843 9927 9939 9961 10000 10027 10154 
10267 10285 10321 10330-10331 10342 10344 10349 10496 
10500 10550 10556-10557 10579 10679 10772 10776 10778 
10788 10795 10802 11013 11132 11136 11146-11149 11406 
11483 11565 11588 11600 1 1606-1 1607 1 1615 11626 11807 
11818 11932 11955 12006 12034 12037 12041 12044 12078 
12092-12094 12123-12126 12150 12213 12375 12381 12387 
12405 12411 12424 12426 12522 12570 12576 12590 12601 
12610 12612 12615 12617-12618 12663 12670 12674 12707 
12729 12739 12749 12753-12754 12777 12830 12842 12913- 
12914 12916-12918 12929 12940 12972 12975-12977 13002 
13024 13047 13051 13054 13062 13064 13082-13084 13090 
13092 13094 13123 13126 13136 13195 13235 13263 13380 
13389 13392 13394 13400 13456 13479 13488-13489 13492 
13494-13499 13502 13506-13507 13514 13516 13546 13555 
13568 13575 13590 13592 13613 13616 13621 13630 13649 
13659-13660 13667-13668 13670-13673 13678-13679 13690- 
13692 13713 13796-13797 13838 13866-1386913872 13882 
13884 13888 13893 


Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM001 


150 2023 2327 2490 4109 4783 5503 5560 10267 10760 12017 
12160 12557 12582 12923 13020 13514 


Genomic 
clones from 
the short arm 
of 

chromosome 


Genomic 

DNA 

from 

Genetic 

Research 


EPM003 


5560 12017 12146 
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8 








Genomic 
clones from 
the short arm 
of 

chromosome 
8 


Genomic 

DNA 

from 

Genetic 

Research 


EPM004 


4783 4798 5560 10817 11926 12017 12160 


esophagus 


BioChain 


ESO002 


999-1000 2449 3272 33 15 3548 3550 3634 3697 3796 4011 
4025 4058 4201 4282 5106 5163 5553 6082 6873 7739 9304 
10296 11133 11818 12033 12570 13869 


fetal brain 


Clontech 


FBR001 


51 142 1184 3664 4060 4109 4940 5021 5270 5523 5553 6112 
6805 6908 7294 8558 9457 10376 11059 11985 12006 12122 
12160 12754 13438 13507 13888 


fetal brain 


Clontech 


FBR004 


60 2704 371 1 4025 4109 4783 5001 6082 7597 9010 9504 
9949 11837 12033 12039 12363 12705 12905 13020 13503 
13512 13891 


fetal brain 


Clontech 


FBR006 


6 60 67 598 800 932 1004 1 170 1793-1794 1796-1797 1799- 
1805 1905-1914 1916-1958 1974 1976 1979 1983 2057 2129 
2174 2221 2407 2444 2449 2484-2492 2530-2554 2556-2561 
2563 2576 2857 3064 3207 3479 3556 3673 3709 3722 4060 
4078 4157 4218-4221 4223-4224 4277 4291-4334 4338 4355 
4364 4369 4431 4957 5001 5109 5270 5380 5553 5634 5696- 
5706 571 1 5724 5766 5788 5794 5801 5805-5832 5834-5879 
5882-5901 5936 5990 6057 6723-6732 6765 6770-6791 6797 
6805 6894 7049-7050 7100-7102 7105 7118-7123 7125 7127 
7169 7905 8263-8265 8267-8273 8294 8312-8333 8359 8361 
8375 8452 8633 8664 8740 8757 8884 9010 9111 9432-9436 
9503-9516 9518-9545 9547-9551 9556 9570 9577 9780 9895 
9923-9924 9928 9942 10007 10027 10202 10263-10268 10276 
10284 10298-10310 10329 10331 10496 10542 10595 10621 
10736-10737 10755-10761 10772 10774 10795 11108 11132 
11406 11483 11509-11523 11555-11582 1 1589-11590 1 1600 
11606 11621 11713 11729 11807 11837 12006 12039 12044 
12092 12113 12218 12231 12236 12327 12363 12398 12405 
12465 12511 12524-12530 12576-12577 12579-12601 12729 
12735 12754 12863 12869 12889 12906-12910 12914 12954 
12973 13020-13021 13051-13052 13054 13065 13082-13083 
13427 13445 13455-13470 13488 13490 13496 13498-13501 
13507 13516 13560 13613 13630 13649 13708 13713 13769 
13831 13845-13855 13868 13872 13882 13884 13888-13894 
13896-13900 


fetal brain 


Clontech 


FBRs03 


1005 4405 5111 6337 6964 7742 13084 13864 13891 


fetal brain 


Invitrogen 


FBT002 


51 83 142 321 430 746 932 1054 1058-1065 1493 1833 1947 
2273-2275 2299 2444 2449 2926 3479 3492 3885 4347 4354 
4391 4405 4410 4434 45304804 4985 4998 5075 5160-5169 
5380 5428 5466 5750 5788 5801 5895 6132 6215 6371-6374 
6458 6598 6973-6974 7067 7096 7776-7778 7780-7782 7937 
8143 8323 8361 8364 8372 8377 8452 8633 8977-8984 8986 
9010 9142 9264 9332 9457 9474 9503 951 1 9517 9539 9582 
9827 9848 9927 9950-9953 10027 10161 10329 10430 10492 
10573-10575 11014 11160-11164 11406 11628 11742 11814 
11830 11985 12092 12112-1211412116-12117 12127 12424 
12511 12521 12570 12576 12643 12696 12735 12748-12751 
12754 12830 12835 12913 12957 12977 12990 13002 13020 
13062 13072 13083 13117 13254-13259 13377 13486 13489 
13496 13499 13507 13590 13649 13685-13688 13713 13867 
13888 13891 13893 


fetal heart 


Invitrogen 


FHR001 


1001 1004 2250 4025 6334 6765 7740 8933 8935 9457 9544 
10000 11132 12599 12609 13021 13568 13656 13866 


fetal kidney 


Clontech 


FKD001 


142 346 364 51 1-517 570-572 574 598 685-690 969-970 972- 
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981 983-987 1134 13462123 2167-2169 2238-2246 2342 2444 
2483 25 16 2555 2617 2728 2843 2876 3032 3049 3072 3206 
3208 3282-3283 3525-3531 3533-3543 3548 3591 3709 3722 
3797 3839 3878 401 5 401 9 4043 4246 4277 4367 4405 4696- 
4698 4725 4767 4805-4810 4940 4947 4957 4986 4998 5037 
5056 5073-5080 5082-5091 5099-5100 5108 5258 5504 5523 
5560 5923 6005 6207-6208 6225 6272 6288 6325-6332 6478 
6603 6702 6793 6815 6906 6953-6959 7045 7058 7204 7355 
7426 7449-7450 7520 7543-7546 7561 7587 7718-7732 7930 
8077 8097 8262 8375 8387 8452 8520 8638 8658 8736-8737 
8834 8863 8898 8907-8918 8922 8950 9010 9134 9257 9401 
9457 9544 9597 9760 9781 9791 9828-9830 9912 9914-9918 
10296 10440 10484 10546-10548 10772 11108 11121-11129 
11131-11132 11170 11513 11638 11695 11923-11924 12006 
12033 12062-12070 12072 12160 12405 12522 12570 12594 
12599 12605 12626 12663 12670 12732-12733 12749 12848 
12904 12914 12940-12941 12990 13020 13083 13188 13226- 
13227 13234 13263 13277 13280 13351 13391 13394 13491 
13501 13512 13590 13644 13647-13650 13713 13782 13867- 
13868 13872 13875 


fetal kidney 


Clontech 


FKD002 


3286 5030 5037 5105 11108 12033 12490 12570 13494 13866 


fetal kidney 


Invitrogen 


FKD007 


3272 3806 4025 4253 4277 4654 51 12 5535 5788 5801 8863 
8935 9401 9466 10553 11628 11818 13494 13646 13866 


fetal lung 


Clontech 


FLG001 


79 2367 2395 3010 3460 3885 4828 4948 4962 5001 5723 5748 
5902 5908 6186 6738 7051 7067 7677 7759 9264 9553 9700 
10007 10478 11098 12017 12383 12417 12424 12749 12917 
13020 13169 13472 13554 13644 13782 13835 


fetal lung 


Invitrogen 


FLG003 


142 319 364 629 1671 1806-1814 1816-1819 1877 2129 2161 
2169 2367 2449 2493 2529 3191 3503 3610 4109 4225-4234 
4367 4434 4957 5108 5380 5421 5581 5707-5710 5712 5714 
5788 5801 6057 6733-6741 7034 7103 8274-8278 8365 8597 
8948 9264 9327 9437-9442 9444 9466 9510 9525 9530 9539 
9677 9773 9841 10007 10190 10198 10269-10271 10329 11519 
11524-11527 11927 12531-12539 12848 12890 12904 13021 
13072 13249 13445 13472 13489 13551 13575 13649 13670 
13679 13856-13857 


fetal lung 


Clontech 


FLG004 


1003-1004 2597 5110 6963 9924 10552 11138-11139 12080 
12990 13659 


fetal liver- 
spleen 


Columbia 
University 


FLS001 


-2 4-14 16-22 24 26 28-31 33-46 48-49 51-61 63-68 71-91 93- 
102 104-110 112-124 126-156 158-162 282-283 285-290 292- 
299 301-304 307-3 12 314-326 328-338 340-344 346-353 355- 
365 369 390-400 402 436 441 483 557 567 575-585 595 598 
629 673 678 691-699 701-702 708 731 736 763-767 769-776 
778-786 788-791 793-794 796 925 975 1004 1015 1023 1038 
1068 1104 1134 1144 1184 1192 1216 1264 1298 1346 1482 
1493 1516 1518-1521 1551 1556 1575 1583 1594 1636 1641 
1707 1724 1774 1826-1829 1841 1858 1927 1959 1962-1965 
1967-1972 1974-1979 1981-1998 2000-2009 2011 2045-2051 
2053-2055 2057-2058 2060-2063 2065-2083 2089-2094 2100- 
2101 2161 2170 2174 2184-2194 2215 2222 2269 2290 2310 
2342 2409 241 1 2414 2444 2449 2458 2483 2490 2497-2498 
2510 2516 2523 2529 2555 2562 2566-2576 2578-2586 2588- 
2591 2593-2601 2604 2607-2608 2611-2612 2614-2618 2620 
2622-2642 2644 2646-2653 2655-2664 2666 2668 2670-2680 
2696 2750 2788-2793 2795-281 1 2814-2826 2828-2835 2837- 
2842 2844 2846-2848 2851-2858 2860 2862-2871 2876 2878 
2893 2900-2905 2907-2909 2926 2929 2949 2952-2953 2959- 
2960 2984 2992 3032 3058 3069 3073-30763078-3080 3082 
3093 3166 3194 3196 3207 3210-3211 3213-3214 3217-3225 
3249 3257 3272 3282 3286-3287 3304-3307 3310-331 1 3314- 
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3317 3319 3321-3332 3334-3346 3356 3419 3426 3434 3446 
3450 3455 3468 3470 3479 3484 3494 3503 3535 3537 3548 
3552 3557 3572 3590 3594 3597 3604-3605 3610 3612 3625 
3627 3634 3668 3670 3709 371 1 3720 3722 3729 3737 3777 
3797 3806 3808 3810 3813 3839 3885 3926 3990 3996-3997 
4006 4009 401 1 4019-4022 4025 4040 4043 4060 4078 4095 
4109 4129 4192 4194 4201 4245-4246 4253 4261 4277 4282 
4297 4335-4338 4340-4357 4359-4361 4364 4366-4367 4370- 
4372 4374-4377 4379-4382 4384-4386 4388-4392 4395-4396 
4398-4414 4417-4424 4426-4455 4457-4458 4460-4465 4542- 
4543 4545 4547-4553 4555-4562 4565-4575 4577 4579 4581- 
4582 4585-4588 4593 4596 4607 4616-4629 4644-4645 4647 
4654 4671 4676 4687 4689 4694 4721 4726-4729 4759 4767 
4775 4783-4784 4788 4790-4791 481 1-4819 4830 4837 4845 
4862 4874 4889-4902 4930 4940 4948-4949 4957 4962-4965 
4972 4985 4998 5022 5029-5030 5037 5040 5075-5076 5092 
5108 5132 5152 5163 5167 5171-5172 5335 5380 5398 5473 
5503 5514 5516 5523 5526-5527 5535-5536 5553 5581 5598 
5604 5616 5674 5684 5691 5693 571 1 5715 5724 5726 5748 
5750 5757 5778 5788 5794 5801 5817 5865 5874 5894-5895 
5902-5914 5916-5919 5921-5928 5930-5949 5951-5954 5956- 
5964 5966 5979 6005 6010 6013-6015 6017-6018 6020-6022 
6024-6025 6028-6032 6034-6048 6052-6062 6064 6068 6073- 
6081 6086 6092-6094 6104 61 12 6125 6135 6140 6147 6149 
6153-6157 6165 6186 6195 6209-6211 6240-6241 6243-6248 
6250-6253 6272 6276 6287-6288 6307 6313 6338 6371 6374 
6419 6430 6446 6451 6458 6478 6496 6508 6545 6579 6595 
6598 6607 661 1 6624 6629 6642 6658 6695 6726 6728 6730 
6738 6745 6754 6765 6767 6788 6793-6797 6799-6808 6810- 
6815 6833-6843 6845-6847 6850-6856 6858 6860 6862-6865 
6870-6871 6875 6877 6891 6893-6894 6900 6907-6908 6917 
6919-6925 6950 6979 7049 7058 7067 7077 7096 7109 71 16 
7118 7128-7131 7133-7135 7137-7141 7144-7147 7149-7150 
7152-7153 7155-7179 7181-7192 7194-7198 7265-7271 7273- 
7280 7282-7305 7307-7312 7316 7327-7336 7375-7376 7383 
7386 7451 7455 7468 7473 7475-7477 7479 7484 7548-7553 
7555-7558 7561 7608 7617-7618 7620-7631 7642 7675 7687 
7695 7716 7768 7809 781 1 7839 7842 7864 7883 7933 7977 
8069 8093 8095 8105 8110 8116 8139 8224 8226 8235 8241 
8262 8323 8334-8340 8342-8350 8352-8355 8357-8359 8361- 
8408 8410-8411 8414-8419 8452 8456 8490-8498 8500-8502 
8504-8526 8529-8530 8537 8540-8541 8553-8560 8563 8566 
8568 8577 8592 861 1 8639 8648 8659-8661 8664 8668-8669 
8710 8738-8739 8741 8777 8792-8804 8830 8834 8839 8853 
8858 8863 8865 8876 8898 8926 8935 8939 8950-8951 8957 
8963 9010 9012 9076 9092 9097 91 12 91 19 9128 9257 9264 
9266 9284 9291 9301 9304 9313 9318 9352 9375 9399 9414 
9424 9445 9455 9457 9466 9473-9474 9476 9480 9484-9485 
951 1 95 17 9528-9529 9536 9539 9542 9544 9552-9557 9559 
9561-9564 9566-9572 9574-9577 9579-9630 9646 9655 9666- 
9667 9669-9671 9673 9675-9697 9700-9701 9703-9704 9713- 
9715 9718-9719 9722 9726-9729 9734 9752 9758 9761 9777 
9780-9784 9813 9831 9854-9867 9869 9906 9927 9929 9939 
9942-9943 9969 9983 10000 10007 10205 10267 10284 10296 
10305 10311 10313-10316 10318-10325 10327 10329-10331 
10333-10354 10356-10363 10395-10397 10400-10403 10405- 
10410 10413 10417-10422 10427 10446-10447 10463 10470 
10485 10496 10502-10507 10515 10542 10557 10565-10566 
10645 10679 10688 10691 10729 10750 10760 10762 10764- 
10769 10772-10774 10776-10782 10784 10786-10795 10797- 
10815 10817-10830 10859 10861-10869 10871-10879 10881- 
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10893 10908-10916 10929 10934 10943 10948 10968 10971 
10975 10977-10983 11005 11009 11011 1 1015-11018 1 1028 
11031 11054-11063 11094 11108 11114 11132 11136 11142 
11194 11208 11213 11246 11390-11391 11401 11406 11411 
11414 11459 11462 11483 11508 11513 11521 11534 11559- 
11560 11565 11573 11585-11603 11606-11610 11612-11616 
11618-11638 11640 11642-11657 11659-11667 11669-11670 
11672 11699 11716-11718 11720-1 1722 1 1724-11741 11743- 
11758 11760-11761 1 1775-1 1787 1 1797 11799-11801 11818 
11830 11862 11864-11866 11869 11876 11889 11911 11925- 
11928 11958-11961 11963-1197611979-11980 12000 12006 
12017 12019 12028 12033-12034 12041 12078 12086 12092 
12122 12127 12159-12160 12178 12216 12218 12236 12245 | 
12256 12260 12289 12303 12305 12308 12327 12330 12363 
12375 12382 12387 12402-12405 12424 12426 12439 12442 
12465 12470 12476 12511 12522-12523 12546 12551 12560 
12570 12576 12590 12601-12602 12604-12619 12621-12633 
12636 12639 12646-12653 12670-12671 12674 12682-12683 
12688 12693 12696 12706 12709-12715 12735 12749-12750 
12772 12790 12823 12830-12832 12835 12842 12849 12855 
12863 12904-12905 12910-12911 12913-12916 12918-12924 
12929 12931-12932 12934-12937 12939-12940 12954-12955 
12965 12973 12976-12977 12989-12990 12993 12999 13020- 
13021 13035 13051 13054-13055 13060 13062 13066-13067 
13069-13087 13089-13095 13097-13102 13114-13117 13119- 
13120 13122-13126 13134-13136 13141 13147 13156 13174 
13193 13195-13196 13198-13201 13254 13260 13264 13277 
13280 13295 13351 13368 13377 13391 13394 13400 13409 
13412 13420 13456 13472-13477 13479-13484 13486-13492 
13494-13496 13498-13499 13501-13510 13512-13516 13518- 
13527 13533 13541-13544 13546-13551 13553-13555 13560 
13575 13587-13589 13597 13603-13604 13613 13616 13622- 
13623 13630-13631 13644-13646 13649 13659 13670 13679 
13713 13743 13748 13769 13775 13782 13793 13803 13808 
13818-13819 13858-13860 13864 13866-13869 13872 13882 
13884 13888-13889 13891 13893 13901 


fetal liver- 
spleen 


Columbia 
University 


FLS002 


6 16 24 30-31 63 67 81 83 89 95 103 115 117 126 140 142 147 
150-151 158 162 211 225 287 308 332 356 358 390-391 438- 
439 483 551 556 641 654 694 701 708 731 788 997 1006 1012 
1047 1082 1151 1154 1178 1184 1208 1212 1356 1480 1507 
1551 1556 1623-1647 1649-1667 1669-1671 1675 1677-1698 
1828-1829 1858 1877 1885 1889 1914 1927 1947 1961 1997 
1999 2028 2057 2070 2092 2098 2138-2139 2154 2161 2174- 
2175 2189 2191 2257 2282 2327 2342 2363 2399 2403 2409 
2440-2441 2443-2450 2452-2459 2490 2498 2522-2523 2544 
2555 2568 2575-2576 2592 2597 2605 2619 2623 2659 2806 
2809 28 12 2860 2869 2879 2903 2926 2932 2988 303 1 3066 
3075 3089 3188-3189 3286-3287 3319 3343-3344 3356 3426 
3535 3548 3552 3554-3555 3583 3591 3610 3625 3634 3673 
3709 3711 3720 3722 3839 3899 3926 3985 4005-4006 4011 
4020 4025 4055 4058 4060 4078 4091-41 16 41 1 8-4126 4128- 
4142 4172 4194 4201 4253 4277 4282 4347 4350 4353-4355 
4362 4367 4374 4379 4386 4391 4394 4396 4402 4426 4431 
4435 4437-4439 4512 4553 4578 4586 4607 4622 4644 4654 
4671 4758 4767 4783 4798 4836 4845 4899 4940 4948 4962- 
4963 4985 4991 4998 5001 5037 5108 5167 5171 5177 5198 
5237 5293-5294 5380 5400 5523 5535-5536 5581-5591 5593- 
5599 5601-5613 5615-5624 5627-5637 5653 5674 5691 5693 
571 1 5724 5726 5733 5748 5757 5772 5778 5794 5817 5874 
5894 5902 5904 5906-5907 5909 5911 5916-5919 5923-5924 
5927 5929 5932 5938 5941 5948 5957 5959-5960 5962 5964 
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5979 6005 6020 6027 6037-6038 6052 6054 6057 6060-6062 
6064 6068 61 12 6140 6195 6225 6246 6287-6288 6306-6307 
.6336 6354 6371 6374 6423 6430 6451 6458 6461 6469 6478 
6496 6508 6538 6545 6563 6580 6583 6595 6598 6607 6629 
663 8 6641-6643 6645-6675 6709 6726 673 0 6765-6766 6788 
6792-6793 6795 6797-6798 6800-6801 6808 6812 6836 6843 
6850-6851 6855 6860 6864 6870 6875 6888 6908 6924 6950 
6958 6968 6998 7003 7018 7045 7049 7056 7071-7079 7095- 
7096 7109 7116 7118 7126 7133 7135 7137-7139 7155-7156 
7166-7167 7169 7175 7178 7275 7291 7294 7329 7422 7426 
7478 7608 7627 7687 7695 7716 7815 7839 7871 7893 7914 
7937 7977 8022 8095 8120 8133-8134 8148-8149 8151-8167 
8169-8182 8184 8186-8192 8194-8196 8241 8246 8276 8289 
8298 8336 8339 8343 8345 8349 8355-8356 8361 8363 8365 
8367-8368 8370 8373 8375-8378 8385 8388 8417 8496 8518 
8520 8543 8558 8561 8563 8646 8667 8710 8738-8740 8786 
8803 8813 8865 8926 8946 8948 8963 8970 9010 9049 9119 
9128 9142 9164 9222 9264 9289 9296 9301 9317-9318 9320 
9322 9335-9341 9343-9362 9365-9368 9370 9372-9373 9375- 
9377 9399 9455 9457 9466 9472 9475 9480 9483 9495 9526 
9533 9536 9553 9556 9558 9560 9563 9567 9570 9582 9597 
9601 9630 9646 9655 9671 9683 9695 9700 9703 9715 9722 
9729 9733 9752 9758 9783 9843 9848 9855 9880 9936 9942 
9983 10007 10027 10103 10142 10186-10217 10274 10284 
10287 10296 10313-10314 10329 10331 10349 10352 10354 
10400 10405 10430 10439 10496 10500 10507 10542 10621 
10709-10710 10712-10717 10750 10760 10776-10779 10782 
10794 10809 10816 10837 10885 10891 10895 10913 10960 
10971 10975 11057 11071 11098 11142-11143 11148 11194 
1 1246 1 1401 1 1406 1 1430-1 1453 1 1455-1 1464 1 1483 1 1504 
11508 11556 11560 11589 11596 11615 11618 11626 11631 
11714 11729 11830 11833 11865 11988 12000 12006 12017 
12019-12020 12041 12044 12081 12092 12178 12195 12245 
12256 12277 12297 12327 12363 12383 12402-12403 12405 
12426-12427 12444-12455 12457-12465 12467-12473 12475- 
12476 12520 12546 12576 12599 12607 12611 12614-12615 
12617-12618 12622 12627 12630-12632 12636 12647-12648 
12650 12685 12688 12691 12693 12696 12703 12739-12740 
12743 12763 12808 12830 12834 12842 12849-12858 12860- 
12866 12885 12895 12906 12910 12913 12915 12917 12920 
12922 12929 12933 12940-12941 12954-12955 12957 12965 
12977 12990 12993 12999 13021 13039-13040 13047 13051 
13054-13055 13060 13062 13072-13073 13077 13082-13083 
13085 13092-13095 13098 13117 13123 13136 13173 13195- 
13197 13202 13249 13254 13362 13377 13391 13394 13406- 
13409 13411-13415 13417-13420 13456 13473-13474 13477 
13480 13484 13486 13488-13489 13496-13498 13500 13502 
13504 13506-13507 13511-13512 13515 13521 13546 13551 
13554-13555 13560 13575 13590 13597 13613 13616-13617 
13622 13630 13659-13660 13670 13678-13679 13695-13696 
13701 13708 13713 13726 13748 13775 13795-13796 13815- 
13826 13835 13838 13859-13860 13864 13867 13869 13872 
13882-13885 13888-13889 13891 13893 


fetal liver- 
spleen 


Columbia 
University 


FLS003 


525 2269 2529 2627 2636 3552 3554-3555 3597 4201 4246 
4253 4422 4426 4434 4438 5536 5801 8935 9536 11136 11142 
11519 11626 11927 12028 12077-12079 12086 12305 12403 
12424 13021 13472 


fetal liver 


Invitrogen 


FLV001 


40 51 60 598 731 1023 1048-1049 1820 1976 21 11 2268-2269 
2449 2483 2529 2568 2576 2614 2809 3207 3220 3468 3595- 
3599 3885 4020 4346 4390 4422 4434 4543 4615 4767 4833 
5092 5152-5155 5270 5505 5526 5553 5581 5715 5724 5726 
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5750 5766 5788 5794 5801 5902 6125 6147 6458 6598 6624 
6645 6765 6799 6805 6862 7104 7856 8070 8156 8359 8361 
8432 8740 8928 8935 8972-8974 9012 9041 9128 9264 9445 
9474 9485 951 1 9536 9593 9597 9703 9780 9927 9943 9945- 
9946 9948 10007 10137 10275 10334 10529 10557 10566- 
10569 10776 10828 1 1 142 1 1560 1 1587 1 1618 1 1927 12070 
12086 12105-12108 12127 12218 12236 12327 12424 12511 
12541-12542 12605 12746 12916 12977 12985-12986 13020- 
13021 13060 13064 13135 13246-13247 13472 13479 13488- 
13489 13497 13533 13554 13590 13659 13680-13682 13859- 
13860 13882 13888-13889 


fetal liver 


Clontech 


FLV002 


360 996 1134 3226 5102 9401 


fetal liver 


Clontech 


FLV004 


998 1927 2449 2627 4025 4043 4426 4438 4834 5030 5726 
6730 9474 9942 10769 1 1 142 1 1587 12590 12608 13889 


fetal muscle 


Invitrogen 


FMS001 


40 150 731 894 1264 1555 1613 1821-1822 1883 1959 2161 
2343 2494-2496 2555 3203 4235-4237 4434 4671 4694 4783 
4885 4940 4985 5037 5045 5481 5716-5718 5724 5788 5902 
5932 6624 6742-6743 6825 7054 7104 7141 7632 7800 8279- 
8281 8863 8935 9204 9264 9446-9449 9451 9474 951 1 9536 
9556 9722 9780-9781 10097 10273-10274 10330 10738-10739 
10750 11098 11406 1 1528-11530 1 1560 12305 12544-12546 
12590 12609 12618 12663 12670 12702 12735 12891-12892 
12916 13051 13053 13195 13441-13442 13479 13613 13782 


fetal muscle 


Invitrogen 


FMS002 


3378 4348 4434 8932 11132 12033 12570 13782 


fetal skin 


Invitrogen 


FSK001 


60 142 235 319 641 683 800 1015 1050 1346 1774 1823-1825 
2044 2099 2 1 1 1 2 1 6 1 22 1 5 2223 2280 23 67 240 1 2408 25 1 3 
2516 2614 2812 2871 2926 3207 3356 3468 3490 3503 3548 
3599 3720 3722 3885 4020 4032 4060 4109 4238 4240-4244 
4246 4253 4277 4340 4353 4355 4388-4389 4391 4405 4434 
4543 4562 4568 4654 4667 4671 4767 4940 4944 4947 5030 
5037 5075 5163 5198 5296 5380 5481 5514 5581 5656 5674 
5691 5719-5724 5748 5757 5801 5894 5902 5923 6057 6076 
6078 6125 6147 6215 6336 6374 6409 6437-6438 6551 6563 
6744 6783 6803 6805 6815 6873 6900 6908 6917 6985 6987 
7037 7054 7058 7067 7075 7275 7355 743 1 7484 7507 7587 
7627 7642 7653 7687 7871 7977 8110 8134 8186 8226 8235 
8251 8262 8282-8287 8298 8356 8365 8368 8452 8455-8456 
8539 8597 8648 8679 8813 8863 8898 8921 8935 8939 8948 
8951 8957 8983 9002 9041 9061 9122 9128 9257 9285 9414 
9452-9453 9466 9474 9484 9503 9517 9529 9553 9563 9671 
9677 9703 9714 9781 9841 9927 10007 10175 10296 10343 
10491 10741-10742 10778 10795 10907 10948 11239 11513 
11531 11573 11606 11626 11807 11869 11952 12000 12006 
12160 12190 12218 12256 12327 12363 12371 12375 12378 
12426 12438 12465 12470 12488 12549-12550 12570 12604 
12615 12617 12663 12670 12735 12749 12754 12830 12893 
12904 12910 12916-12917 12976-12977 13038 13084 13090 
13116 13249 13254 13367 13389 13391 13443-13445 13456 
13472 13479-13480 13494 13496 13499 13505 13512 13516 
13551 13554-13555 13575 13590 13613 13630-13631 13644 
13670 13713 13782 13784 13793 13803 13858 13866 13869 
13882 13891 


fetal skin 


Invitrogen 


FSK002 


1004 3544 4834 5523 9922 9942 11134-11135 12570 13495 
13499 13793 13884 13889 


fetal spleen 


BioChain 


FSP001 


997 5030 9466 11108 12033 12749 13590 


umbilical 
cord 


BioChain 


FUC001 


60 83 89 142 166 567 609 760 997 1302-1304 1306-1307 1309 
1671 1697 1724 1848 1917 1978 2111 2154 2161 2207 2215 
2315 2343 2444 2569 2576 2591 2597 2652 2866 2926 3468 
3526 3599 3602 3625 3697 3722 3799-3805 3813 3839 3885 
4025 4060 4173 4197 4246 4277 4340 4364 4387 4391 4395- 
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4396 4405 4410 4434 4543 4589 4667 4694 4767 4834 4865 
4940 4957 4963 4985 5030 5093 5096 5108 5163 5171 5213 
5313 5340 5342-5344 5380 5481 5521 5526 5627 5656 5674 
5714 5743 5750 5757 5801 5895 5902 5912 5938 6027 6061 
6082 6125 6186 6195 6218 6241 6313 6321 64106458 6486- 
6495 6595 6607-6608 6668 6745 6825 6860 6870 6979 7054 
7141 7260-7261 7275 7329 7355 7373 7383 7444 7579 7586 
7677 7716 7807 7930 7935-7936 7995 8077 8093 8226 8246 
8262 8298 8343 8345 8358 8387 8452 8535 8563 8635 8669 
8711 8819 8858 8863 8921 8935 8943-8944 8948 8963 9001- 
9002 9012 9097 9130 9132 9134-9138 9264 9303-9304 93 13 
9332 9401 9424 9466 9472 9474 9544 9597 9665 9677 9700 
9722 9734 9758 9834 9841 9848 10050 10175 10183 10267 
10277 10311 10337 10446 10470 10508 10529 10566 10691 
10772 10828 10961 11108 11132 1 1145 1 1239 1 1241 11273- 
11276 11293 11408 11484 11513 11596 11604 11607 11620 
11695 11823 11909 11927 11942 12000 12077 12160 12245 
12255-12256 12292 12305 12308 12363 12383 12404-12405 
12426 12438 12511 12570 12601 12605 12630 12700 12706 
12754 12875 12913 12916 12935 12957 12993 13002 13009 
13020-13021 13062 13135-1313613141 1325413283 13318- 
13319 13336 13394 13400 13472 13479 13481 13488-13489 
13495-13496 13499 13507 13511-13512 13516 13551 13555 
13568 13575 13590 13613 13630 13644 13713 13740-13742 
13866-13867 13882 13884 13888 


fetal brain 


GIBCO 


HFB001 


51 70 211-213 215-222 224-229 445 586-588 598 608 703-710 
712-716 1068 1148 1178 1184 1193 1308 1516 1556 1774 1778 
1957 1974 1976-1977 2010 2025-2029 2134-2135 2145 2161 
2171 2174 2220 2223 2257 2266 2283 2444 2449 2510 2555 
2563 2569 2590 2597 2614 2617 2717 2719-2723 2726 2729- 
2734 2809 2997 3002 3081 3083 3086 3109 3226-3237 3239 
3257 3272 3278 3282-3283 3286 3356 3419 3460 3479 3492 
3495 3544 3548 3557 3590 3604-3605 3625 3627 3634 3673 
3689 3697 3709 3722 3797 3810 3839 3885 4006401 1 4019 
4022 4025 4040 4054 4059 4095 4194 4201 4253 4277 4282 
4355 4364 4383 4399 4405 4410 4412 4434 4473 4480 4498- 
4504 4507-4508 4517 4543 4654 4689 4694 4730-4733 4735 
4755 4783 4820-4823 4825-4830 4845 4885 4940 4949 4957 
4962 4972 4985 5017 5019 5022 5031 5037 5108 5132 5167 
5171-5172 5262 5380 5481 5521 5523 5527 5553 5616 5656 
571 1 5740 5788 5801 5894 5902 5906 5908 5923 5938 5979- 
5989 6057 6082 6096 6125 6147 6159 6161-6163 6165 6207 
6212-6213 6215-6222 6241 6257 6288 6336 6369 6374 6419 
6422 65 12 6551 6555 6595 6598 6607 671 1 6767 6788 6823- 
6825 6860 6871 6879 6892-6894 6900 6909-6910 7058 71 18- 
71 19 7169 7220 7222 7224-7225 7228-7229 7231-7233 7275 
7426 7431 7444 7481-7483 7485-7488 7560-7567 7569 7608 
7743 7768 7792 7814 7845 7864 7930 7977 7995 8093 8095 
8110 8140 8226 8235 8262 8345 8358 8361 8370 8372 8387 
8441-8443 8445 8447 8450-8455 8520 8535 8558 8597 8603 
8648 8663-8666 8677 8702 8742-8749 8811 8838-8839 8943 
8951 9010 9092 9134 9137 9257 9287 9304 9317 9455 9457 
9466 9472 9475 9484 9510 9515 9533 9553 9567 9644-9645 
9647-9649 9734 9758 9781 9785-9786 9791 9832-9837 9927- 
9928 9939 9942 9970 10053 10175 10275 10277 10296 10329 
10375-10378 10434 10464 10486 10496 10645 10679 10691 
10778 10782 10791 10838-10844 10928 11019-11025 11027 
11032 11055 11061 11108 11132 11145 11153 11208 11239 
11343 11483 11513 11588 11596 11604 11606 11620-11621 
11668 11691-11695 11818 11867 11869 11929-11930 12006 
12033 12039 12041 12044 12047 12066 12078 12197 12218 
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12327 12375 12405 12411 12424 12438 12521-12522 12564 
12570 12576 12582 12590 12599 12601 12604 12611 12626 
12630 12637-12640 12670 12674 12688 12694-12695 12703 
12734 12748 12754 12785 12835 12840 12913 12916-12917 
12929-12930 12935 12940 12946 12959 12989 12993 13020 
13051-13052 13054 13073 13083 13090 13108-13110 131 17 
13131 13136 13148 13158 13160 13169 13184-13186 13254 
13263 13277 13295 13389 13394 13438 13456 13477 13479- 
13481 13489-1349013494-13500 13502-13503 13506 13512 
13516 13535-13536 13555 13575 13590 13592 13605-13606 
13613 13616 13627 13630 13644-13645 13660 13670 13679 
13687 13713 13793 13803 13838 13861 13866 13868-13869 
13875 13888 13891 13896 


macrophage 


Invitrogen 


HMP001 


1002 2253 3548 4011 4058 4201 4246 4282 5526 8093 8262 
11137 12039 12426 12511 12521 13888 


infant brain 


Columbia 
University 


IB2002 


6 89 142 21 1 276 307 518 589-590 598 644 717-718 720-721 
773 841 921-924 926-932 937 1023 1091 1147-1150 1289 1493 
1522-1525 1557-1564 1724 1778 1928 1947 1976 1978 2097 
2111 2138 2172 2218-2223 2257 2283 2401 2418-2419 2444 
2449 25 1 6 2555 2563 2569 2575 2597 2627 2809 281 8 2880 
2932 3087 3166 3226 3241-3245 3272 3435 3474 3476-3477 
3481 3483-3486 3488-3489 3492 3494-3495 3497 3538 3544 
3548 3572 3595 3597 3669 3671-3672 3709 3720 3722 3737 
3797 3817 3885 3985 3997-3999 4006 401 1 4022-4031 4040 
4095 4201 4246 4253 4277 4297 4355 4405 4410 4434 4676 
4689 4767 4775 4831-4834 4837 4885 4962 4983 4998 5017- 
5019 5021-503 1 5035-5037 5040 5095 5143 5164 5167 5169 
5233-5236 5380 5505 5517-5519 5523 5536 5546-5548 5691 
5695 5724 5750 5766 5788 5801 5S95 6027 6048 6082 6095 
6111 6119 6132 6135 6147 6223 6301-6303 6305-6307 6374 
641 1 6459 6555 6563 6591-6595 6601 6606-6607 6609-661 1 
6613-6614 6624 6711 6767 6799 6805 6909 6946 7018 7052 
7067 7118 7133 7166 7168-7169 7386 7464 7469 7473 7571- 
7573 7697-7698 7716 7814 7827-7828 7905 7934 7977 8071- 
8073 8084 8098-8104 8235 8320 8331 8351 8354-8356 8364- 
8365 8370 8372 8376 8452 8520 8572 8667-8668 8750-8752 
8813 8830 8863 8883-8885 8950 8958 8963 9040-9041 9251 
9257 9267 9293-9298 9313 9424 9454 9456-9457 9466 9510 
9514-9515 9533 9542 9556 9576-9577 9597 9626 9646 9722 
9758 9787 9903-9904 9988 10027 10138-10139 10155-10160 
10284 10296 10319 10346 10496 10536-10542 10557 10670- 
10672 10680 10682 10778 10791 10897 10971 11028-11030 
11052 11100-1110911122 11132 11192 11392 11403-11404 
11447 11513 11568 11595 1 1606 1 1626 1 1638 1 1818 11927 
11931-11933 12028 12030-12031 12033-12034 12036-12039 
12041 12047 12049 12146 12155-12157 12160 12327 12344 
12363 12383-12385 12402 12405-12408 12424 12439 12521- 
12522 12570 12590 12601 12611 12615-12616 12626 12639 
12684-12685 12688 12696-12697 12707 12725-12727 12729 
12748-12749 12754 12763-12764 12830 12904 12913 12916- 
12917 12923 12929 12959 12977 12990 12996 13000 13047 
13051 13054 13062 13073 13082-13084 13117 13141 13187- 
13188 13196 13218-13219 13249 13257 13277 13284-13285 
13351 13389 13394-13395 13456 13458 13479-13481 13488 
13494-13495 13499 13503 13512 13516 13530 13535 13575 
13607-13608 13613 13616 13641 13649 13659 13679 13705- 
13706 13708 13713 13793 13798-13801 13803 13861 13869 
13872 13875 13883-13884 13888 13891-13893 13896 


infant brain 


Columbia 
University 


IB2003 


6 46 746 1914 1947 1959 1994 2111 2220 2257 2516 2962 
3166 3226 3272 3435 3572 3885 4277 4377 4410 4833 4837 
5029-5030 5040 5147 5259 5851 6147 6423 6595 6598 6611 
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SEQIDNOS: 




• 




7145 7169 7716 8262 8354-8355 8364 8452 8863 9264 9457 
9515 9556 9577 9626 9646 9715 9781 9928 9970 10027 10329 
10376 10557 10791 11083 11289 11406 11459 11513 11663 
11933 12039 12405 12546 12570 12601 12616 12685 12754 
12913 12916 12954 12977 12990 13000 13056 13062 13195 
13257 13392 13456 13458 13480-13481 13488 13497 13499 
13506-13507 13511-13512 13514 13516 13549 13616-13617 
13649 13793 13861 13884 13888 13891 13896 


infant brain 


Columbia 
University 


IBM002 


1564-1565 1976 2218 2420 3166 4391 5788 6147 6302 10329 
11108 11513 12039 12729 12830 13062 13257 13512 13617 
13803 13859 13861 


infant brain 


Columbia 
University 


IBS001 


927 931 1564 2915 3166 3737 3885 5029 5044 5095 5143 5701 
5894-5895 6135 6307 7018 8452 8963 9424 9556 9626 10296 
10542 11083 11100 11108 11167 12383 12611 12785 13257 
13280 13479 13495 13506 13617 13679 13713 13896 


lung, 
fibroblast 


Strategene 


LFB001 


67 235 552 641 746 1346 2090 21 86 2343 2770 2812 2871 
3203 3537 3548 3634 3812 3839 4073 4253 4434 4654 4783 
4786 4821 4940 4957 4963 4985 5163 5172 5492 5523 5788 
591 1 5938 6027 6048 6057 6223 6272 6322 6599 6873 68S6 
6997 7204 7759 7815 7933 7977 8226 8235 8262 8354 8452 
8648 8742 8791 8830 8863 8898 8944 8948 8951 9000 9076 
9097 9128 9290 9304 9414 9455 9466 9472 9476 9503 9544 
9563 9671 10000 10050 10342 10376 10434 10447 10679 
10760 10857 11055 11094 11289 1 1402 1 1607 1 1818 11837 
12000 12006 12178 12363 12405 12462 12582 12617 12636 
12670 12754 12913 12916-12917 12925 12940 13126 13377 
13394 13446 13456 13494 13497 13499 13502 13575 13613 
13670 13679 13691 13795 13797 13867-13868 13882 


lung tumor 


Invitrogen 


LGT002 


51 69 88 142 158 211 444 598 608 722-723 733 795-797 799 
1004 1023 1135-1145 1308 1338 1346 1493 1526-1533 1535 
1566-1569 1626 1654 1816 1 841 1848 1860 1976 1 983 2090 
2150 2161 2173 2223 22572266 2296-2297 2342 2402 2449 
2483 2555 2569 2576 2591 261 1 2623 2724 2809 2868 2880 
2926 3088 3166 3272 3347 3571 3658-3666 3673 3722 3737 
3797 3839 3885 4000-4002 4006 401 1 4025 4032-4034 4060 
4133 4201 4246 4253 4277 4282 4340-4341 4384 4388 4391 
4399 4402 4405 4434 4543 4622 4671 4686 4767 4783 4791 
4833-4834 4836 4885 4903 4940 4957 4987 4998 5001 5017- 
5018 5030 5037 5052 5108 5163 5210 5227-5232 5422 5451 
5520-5528 5581 5627 5637 5674 5693 5713 5724 5748 5766 
5772 5788 5894-5895 5902 5907 5917 5938 5990 6005 6020 
6094 6129 6147 6171 6224-6225 6239 6241 6253 6322 6336 
6404-6410 6461 6508 6595-6598 6624 6765 6793 6798 6851 
6870 6873 6994 7003 7008 7045-7047 7053-7055 7067 7085 
7109 7116 7139 7141 7158 7169 7187 7468 7517 7524 7579 
7608 7676 7687 7716 7761 7825-7826 8074-8078 8105-8106 . 
8139 8235 8298 8323 8335 8345 8354 8359 8363 8365 8370 
8372 8375 8456 8563 8633 8648 8678 8741 8792 8798 8805 
8831 8863 8870 8989 9002 9010 9012 9032-9037 9039 9051 
9076 9080 9092 9128 9257 9269-9271 9273-9274 9276 9299- 
9301 9399 9414 9424 9457 9466 9472 9480 9484-9485 9517 
9533 9536 9539 9544 9567 9612 9626 9665 9677 9689 9700 
9703 9758 9773 9784 9788 9838-9841 9870 9928-9929 9942 
9970 9983-9986 10140-10143 10149 10161 10175 10275 10296 
10319 10330 10346 10349 10386 10496 10508 10561 10591- 
10593 10673-10674 10742 10774 10778 10782 10788 10802 
10836 10971 11031-11033 11064 11108 11132 11182-11191 
11393-11395 11406 11459 11462 11560 11565 11588 11596 
11604 11606-11607 11615 11618 11629 11821 11823 11845 
11869 11932 11934 11961 1 1978-11980 12000 12006 12017 
12022 12033 12078 12080 12127 12150-1215412178 12236 
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12327 12363 12371 12379 12386-12393 12409 12424 12521 
12570 12590 12616-12618 12630 12637 12674 12691 12693 
12698-12699 12716 12749 12754 12760 12762 12830 12832- 
12833 12842 12904 12907 12913 12917 12923 12993 13020 
13031-13032 13035 13060 13062 13064 13072 13083 13090 
13136 13150 13193 13197 13202 13234 13277 13279 13281- 
13282 13295-13296 13381-13383 13385-13386 13394 13400 
13456 13477 13479-13480 13484 13488-13489 13492 13494 
13496-13497 13499 13503 13506-13507 13511 13554 13575 
13597 13616 13630 13646 13649 13659-13660 13670 13701- 
13704 13713 13743 13782 13788-13789 13791-13792 13802- 
13803 13864 13867-13868 13872 13882-13884 13888 13892 


lymphocytes 


ATCC 


LPC001 


142 316 1034 19772161 2342 2367 2483 2513 25292932 2962 
3272 371 1 3722 3839 4006 4277 4282 4405 4434 4836 4970 
5037 5239 5451 5706 5796 5924 5941 6048 6147 6301 6765 
6793 6S05 6815 7133 7171 7275 7320 7687 7815 8033 8093 
8095 8105 8110 8246 8262 8345 8365 8368 8370 8535 8664 
8674 8722 8834 8858 8898 8919 8963 9012 9257 9456-9457 
9484 9539 9544 9560 9758 9928 9939 10175 10201 10284 
10333 10496 10520 10529 10679 10744 10766 11032 11108 
11508 11513 11519 11853 11862 12033 12160 12363 12387 
12402 12424 12438 12520 12560 12570 12663 12830 12913 
12923 12968 13021 13066 13072 13085 13140 13147-13148 
13394 13479 13488 13495 13499 13507 13575 13867 13869 
13888 


leukocyte 


GIBCO 


LUC001 


21 49 51 67 83 88 94 142 21 1 316 326 340 368 371 403 438 
443-455 464 485 518-523 525-530 532-535 591-592 616 724- 
726 744 800 841 889 943 1004 1346 1556 1570-1572 1826 
1914 1916 1927 1959 1976-1977 2095 2103-2107 2124-2127 
2134 2138 2145 2161 2163 2175 2215 2223 2237 2266 2343 
2363 2412 2421 2449 2483 2529 2555 2569 2575 2597 2617- 
2618 2695-2696 2827 2876-2877 2899 2903 2961 2965-2975 
3002 3035-3038 3040-3046 3 109 3132 3166 3199 3203 3207 
3246-3251 3265 3272 3286-3287 3312 3344 3356 3361 3378 
3400 3434 3451 3468 3479 3484 3494 3503 3552 3590 3610 
3627 3634 3704 3709 3720 3722 3725 3757 3797 3804 3810 
3839 38S5 3985 3996 4006 4009-401 1 4019 4022-4023 4025 
4035-4037 4058 4060 4194 4253 4269 4277 4297 4341 4362 
4388 4390-4391 4396 4399-44004402 4405 4410 4412 4431 
4434 4534 4543 4594 4615 4630 4641 4651-4663 4670 4688- 
4689 4694 4699-4706 4736 4755 4758-4759 4767 4775 4783 
4791 4798 4828 4835-4840 4845 4865 4930 4940 4947-4948 
4957 4962 4964 4972 4976 4985 4991 4998 5001 5003 5017 
5022 5030 5037 5040 5075-5076 5108 51 18 5143 5163 5171- 
5172 5313 5481 5503 5505 5521 5523 5526 5535 5549 5552 
5656 5691 5724 5726 5740 5750 5766 5772 5788 5794 5796 
5801 5865 5874 5894-5895 5906 5908 5923-5924 5928 5938 
5989 6027 6042 6057 6063-6064 6082 6094-6103 6125 6130- 
6137 6142 6147 6166-6167 6171 6181 6214 6226 6239 6253 
6301 6307 6371 6374 6418 6512 6662 6716 6730 6788 6792 
6799 6815 6836 6860 6873 6876-6877 6886-6887 6945 6975 
6979 7018 7037 7056 7058 7067 7116 7118 7137 7155 7158 
7171 7275 7315 7384-7385 7387-7389 7391-7393 7395-7398 
7452-7463 7468 7494 7608 7671 7676 7687 7714 7733 7792 
7815 7845 7864 7870 7905 7930 8093 8107 8110 8120 8139- 
8140 8224 8226 8262 8276 8320 8363-8365 8368 8375 8387 
8432 8452 8456 8520 8535-8536 8539 8562-8563 8577 8579 
8597 8599-8601 8603-8605 8608 8640-8646 8648 8664 8669 
8674 8677 8691 8702 8722 8753 8755 8798 8815-8816 8830 
8838-8839 8858 8863 8870 8876 8898 8943-8944 8948 8951 
9001 9010 9012 9061 9076 9092 9097 9128 9257 9304 9454- 
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9456 9466 9472-9475 9484 9497 951 1 9515 9517 9529 9536 
9542 9556 9567 9612 9626 9702-9703 9723 9731-9738 9745 
9752 9758 9762-9770 9780-9781 9789 9791 9808 9832 9842 
9927-9928 9939 9942 9970 10000 10175 10275 10319 10329- 
10331 10333 10345-10346 10412 10428 10430 10433 10435- 
10436 10453-10454 10456 10465 10470 10475 10488 10496 
10513 10566 10645 10679 10683 10691 10774 10776 10778 
10788 10791 10795 10802 10891 10928 10930 10935-10941 
10948 10961-1096210964-10967 11034 11055 11108 11132 
11136 11145 11239 11289 11405-11406 11459 11483 11519 
11588 11604 11606 11618 11620 11693 11695 11805 11809- 
11816 11818 11830 11833 11848 11850-1185611868 11927 
11932 11934-11935 11937-11939 11954 11988 12000 12006 
12009 12028 12033 12037 12039 12047 12053 12092 12097 
12118 12127 12143 12159-12160 12178 12186 12327 12336 
12363 12375 12387 12402 12404-12405 12410 12424 12438 
12442 12490 12511 12521-12522 12556 12560 12564 12570 
12590 12599 12601 12605 12615-12617 12636 12643 12653 
12660 12670 12672-12680 12693 12700-12701 12706-12707 
12715 12735 12754 12823 12895 12904 12910 12913 12916- 
12918 12923 12925 12935 12945 12947 12955-12956 12972 
12977 12993 13002 13020-13021 13051 13062 13066 13072- 
13073 13082 13085 13117 13126 13130-13131 13135 13138- 
13140 13147-13151 13161 13169 13189 13254 13295 13391 
13394 13425 13436 13442 13456 13477 13479 13488-13490 
13494-13496 13498-13503 13505-13507 13512 13530 13546 
13550-13551 13554 13557-1356413575-13581 13590 13592- 
13593 13609 13613 13616 13627 13630 13645-13646 13660 
13679 13713 13782 13803 13864 13866-1386913872 13883 
13885 13888 13893 


leukocyte 


Clontech 


LUC003 


536 539 541-543 593 728 1552 1927 1974 3089 3252-3253 
3434 3548 3709 371 1 3722 3797 401 1 4019 4040 4060 4194 
4201 4277 4282 4622 4707 4791 4841-4842 4949 4998 5001 
5030 5345 5895 6052 6138 6227 6419 6595 6754 6765 6767 
6788 6911-6912 7049 7139 7171 7464 7575-7577 8093 8110 
8116 8365 8370 8375 8592 8648 8830 8863 8944 9466 9544 
9612 9758 9771 9773 9790 9929 10326 10346 10679 10779 
10961 11035-11036 I1I32 11401 11513 11853 11940 12160 
12256 12405 12670 12674 12681 12840 12904 12906 13020 
13051 13162 13280 13400 13488 13511 13516 13554 13670 
13679 13860 13869 13883 13889 


melanoma 
from cell line 
ATCC #CRL 
1424 


Clontech 


MEL004 


83 142 360 447 841 1061 1346 1516 1573 1724 1959 1963 
2159 2175 2343 2367 2513 2652 2812 2876 2961 3001 3272 
3345 3604 3627 3722 3754 3839-3840 3985 3988 4040 4194 
4246 4341 4515 4568 4641 4667 4862 4940 5075 5132 5163 
5481 5492 5523 5740 5788 6008 61 12 61 19 6147 6189 6195 
6272 6287 6418 6423 6508 6563 6726 6860 6870 7134 7166 
7169 7275 7294 7579 7677 7748 783 1 7852 7980 81 10 81 16 
8226 8235 8262 8320 8345 8363 8370 8456 8563-8564 8633 
8838 9001-9002 9128 9296 9304 9313 9484 9542 9544 9646 
9703 9758 9780-9781 9808 9999 10007 10027 10296 10346 
10470 11145 11239 11289 11401 11406 11508 11596 11606 
11620 11823 11899 11950 12019 12107 1216012292 12329 
12363 12405 12436 12523 12599 12658 12670 12749 12754 
12774 12842 12930 13020 13296 13394 13458 13489 13498 
13501 13507 13551 13554 13575 13613 13616 13649 13660 
13743 13775 13868 


mammary 
gland 


Invitrogen 


MMG001 


51 67 142 449 594 598 616 708 729-733 1004 1060 1194-1196 
1310-1315 1329 1536-1541 1826 1848 1858 1914 1947 1974 
1976-1978 2174-2175 2283 2299 2310 2336 2343 2403-2406 
2449 2483 2555 2563 2576 2597 261 1 2614 2617 2812 2827 
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2926 3191 3207 3490 3503 3673 3714-3715 3722 3737 3777 
3796 3807-3819 3885 4003-4004 4060 4330 4346 4355 4391 
4399 4434 4543 4671 4767 4791 4836-4837 4843 4940 4964 
4975 4985 4998 5001 5017 5019 5022 5052 5108 5155 5163 
5168-5169 5269 5345-5347 5466 5526-5527 5529 5581 5615 
5724 5772 5778 5788 5794 5894-5895 5902 5907 591 1 5917 
5923-5924 6048 6057 6064 6094 6309 6125 6147 6156-6157 
6215 6228 6321 6430 6458 6496-6499 6545 6579 6598-6599 
6607 67 1 1 6765 6780 6793 6797 6799 6805 6880 693 1 6979 
6987 6996 7012-7013 7048 7054 7057 7104 7141 7158 7169 
7517 7578-7581 7583 7872 7937-7940 8110 8139 8156 8224 
8262 8292 8298 8336 8359 8363 8365 8372 8452 8619 8628 
8646 8648 8722 8758-8759 8782 8813 8815 8863 8882 8939 
8963 8983 9012 9097 91 1 1 9139-9141 9143 9157 9164 9264 
9313 9327 9347 9466 9484 951 1 9517 9544 9553 9563 9577 
9646 9700 9703 9734 9746 9780 9791 9844-9846 9927-9928 
9942 10006-10007 10051-10055 10144 10146-10147 10162 
10285 10330 10346 10439 10490-10491 10496 10542 10557 
10600 10628 10645 10691 10729 10772 10796 10983 11038 
11108 11132 11216-11217 11277-11279 11396-1139711406 
11459 11513 11596 11606-11607 1 1626 1 1628 1 1823 11830 
11833 11902 11941-11943 11979 12000 12009 12037 12047 
12078 12093 12146 12160 12181 12258-12263 12281 12327 
12371 12383 12387 12395 12424 12426 12438 12442 12484 
12511 12564 12570 12590 12594 12601 12605 12609 12615- 
12616 12618 12621 12653 12670 12691 12699 12702-12703 
12749 12771 12791-12794 12834-12835 12885 12898 12905 
12913 12916-12917 12920 12923 12955 12976-12977 13002 
13020 13035 13047 13062 13064 13066 13072-13073 13084 
13090 13092 13127 13135 13169 13190 13193 13195 13249 
13254 13296 13320-13322 13392 13456 13477 13479-13480 
13488-13489 13494 13497-13499 13505-13507 13512 13514 
13516 13546 13551 13554-13555 13590 13611-13612 13631 
13649 13670 13713 13743-13745 13838 13864 13866-13867 
13869 13872 13875 13882-13885 138S8 13891 


induced 
neuron cells 


Strategene 


NTD001 


88 1493 1552 1561 2034 2090 2510 2522 2570 2576 2623 3226 
3272 3839 4006 4058 4282 4384 4694 4783 4888 5103-5104 
5172 5523 5541-5542 5684 5766 5788 5794 5938 6082 6195 
6418 6449 6607 6960 6972 7058 7494 7815 7937 8370 8929- 
8930 8996 9000 9003 9128 9157 9289-9291 9472 9928 10275 
10550 10729 10760 11607 11736 11818 12039 12075 12094 
12256 12363 12381 12423 12522 12707 12735 12863 12906 
12958-12959 12963 13020 13052 13054 13196 13202 13310 
13368 13378 13438 13479 13486 13506-13507 13546 13616 
13654 13659 13691 13795 13891 


retinoid acid 
induced 
neuronal cells 


Strategene 


NTR001 


1552 3839 3885 4282 4434 5030 9308 9466 11108 11132 
12034 12039 12405 12424 12570 12905 13047 


neuronal cells 


Strategene 


NTU001 


88 708 1552 1561 1947 2223 2490 2614 3272 3610 3625 3627 
4032 4201 4253 4297 4337 4434 4543 4622 4641 5104 5163 
5523 5541 5684 5788 6094 6147 6307 6601 6862 7755 8648 
8740 8996 9097 9128 9289 9308 9466 951 1 9544 10333 10779 
10963 11032 11108 11132 12034 12039 12075 12405 12424 
12570 12590 12706 12749 12830 12905 12917 12963 13035 
13073 13280 13490 13507 13613 13616 


pituitary 
gland 


Clontech 


PIT004 


1 148 5674 6068 6371 7759 8093 10430 12605 12636 12658 
12916 13021 13082 13875 


placenta 1 


Clontech 


PLA003 


3885 4025 5030 5536 5711 5923 8349 8367 9921 12033 12076 
12403 12511 13655 13885 


prostate 


Clontech 


PRT001 


67 83 142 225 235 379 486 572 616 1286 1316-1329 1538 
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2023 2139 2147 2161 2266 2290 2337-2344 2346 2367 2422 
2444 251 1 2631 2809 3001-3002 3084 3419 3563 3625 3627 
3797 3810 3821-3827 3829-3835 4019 4022 4173 4282 4332 
4434 4667 4755 4791 4862 4865 4998 5001 5066 5171 5349 
5352-5356 5358-5361 5363-5364 5481 5581 5656 5674 5723 
5740 5902 6027 6047 6125 6321 6371 6374 6427 6458 6463 
6500-6507 6509 6563 6598 6643 6793 6803 6871 7014-7015 
7058 7104 7275 7320 7608 7635 7733 7842 7852 7864 7941- 
7943 7946-7951 7953-7954 8077 8087 8093 8110 8224 8226 
8452 8487 8520 8558 8635 8677 8863 8950 8963 8983 8997 
9012 9145-9148 9150-9154 9269 9302 9317 9466 9503 9528 
9646 9703 9780 10000 10027 10056-10060 10491 10629 10691 
10777 10891 11145 11200 11239 11281 11283-11295 11344 
11406 11761 11837 11862 12006 12166 12264-12278 12305 
12363 12368 12411-12412 12438 12467 12685 12691 12729 
12734 12795-12798 12800 12830 12863 12904 13010-13011 
13104 13146 13295 13323-13326 13377 13394 13456 13473 
13477 13489 13516 13533 13550 13611 13678-13679 13746 
13866-13867 13884 13889 


rectum 


Invitrogen 


REC001 


6 67 142 683 731 997 1 178 1909 1959 2005 2023 2596 261 1 
2614 2809 2926 3314 3333 3455 3722 3780 3 870 4173 4355 
4837 4949 4987 5526 5580 5615 5674 5691 5724 5788 5895 
5909 5924 6057 6112 6195 6765 6805 7815 7833 8095 8664 
8863 9517 9539 9544 9884 9927 10400 10666 11401 11513 
11606 11985 12160 12327 12428 12693 12848 12910 12977 
13051 13064 13072 13420 13494 13497 13507 13512 13515- 
13516 13866 13869 13884 13888 


salivary gland 


Clontech 


SAL001 


67 73 1 800 997 1054 1914 2267 2395 2529 3 136 3595 3627 
4011 4192 4246 4330 4434 4641 4957 4987 5040 5052 5163 
5451 5481 5706 5723 5788 5895 6219 6621 6801 6900 6975 
7045 7733 8110 8372 8535 8563 8635 8830 8951 9000 9010 
9051 93 13 9472 9475 9671 9724 9758 9927 10027 1 1 145 
11695 11725 12017 12284 12363 12424 12427 12570 12609 
12670 12674 12693 12977 13035 13307 13554 13617 13867 
13872 13889 13891 


salivary gland 


Clontech 


SALs03 


1516 1724 1858 5030 6186 13657 13864 


skin fibroblast 


ATCC 


SFB001 


2251-2252 5788 6068 12511 


skin fibroblast 


ATCC 


SFB002 


6068 8951 12511 


skin fibroblast 


ATCC 


SFB003 


4025 5895 7741 


small 
intestine 


Clontech 


SIN001 


142 319 627 654 1034 1063 1197-1198 1330-1338 1340-1359 
1575 1646 1774 1814 19782161 2347-2354 2409 2876 3046 
3419 3460 3605 3716-3718 3737 3797 3837-3839 3841-3843 
3845-3857 3885 3986 4060 4201 4301 4351 4385 4568 4689 
4694 5076 5163 5270-5273 5304 5326 5365 5367-5372 5374 
5503 5550 5701 5772 6064 6094 6171 6288 6427 6430-6432 
6438 6510-6522 6598 6615 6793 6815 6997-6998 7016-7018 
7054 7058 7072 7309 7450 7604 7769 781 1 7873-7876 7955 
7957 7959-7962 7964 8120 8298 8350 8452 883 0 8863 8950- 
8951 8966 9010 9073-9075 9119 9126 9128 9155-9166 9303 
9544 9560 9780 9884 9928-9929 10008-10010 10061-10068 
10097 10262 10330 10351 10601 10630-1063410760 10983 
11061 11219 11296-11308 11310-11313 11513 11620 11693 
12182-12183 12280-12287 12327 12363 12488 12707 12799- 
12801 12922 12991 13012-13014 13035 13051 13064 13297 
13307 13328-13332 13335 13382 13499 13506 13554 13560 
13575 13631 13695 13714 13747-13749 13751 13882 13884 


skeletal 
muscle 


Clontech 


SKM001 


1104 1346 2363 2367 2495 2555 2876 2880 3555 3634 3722 
4011 4022 4194 4201 4253 4277 4282 4434 4641 4940 4972 
4998 5343 5481 5523 5801 6005 6336 6873 7408 7995 81 10 
8120 8235 8262 8292 8345 8372 8576 8740 8830 8936 8951 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








9303 9689 10616 10679 11132 11145 11825 11869 11927 
12387 12442 12467 12570 12663 12735 12916 13280 13503 
13872 13885 ! 


skeletal 
muscle 


Clontech 


SKM002 


8535 


skeletal 
muscle 


Clontech 


SKMsOS 


6336 6962 8936 


skeletal 
muscle 


Clontech 


SKMs04 


770 1724 3797 4277 6336 12405 13658 


spinal cord 


Clontech 


SPC001 


83 142 390 415 598 668 708 731 1184 1199-1207 1360-1375 
1377-1396 1516 1574-1576 1595-1596 1849 1927 2070 2129 
2161 2311-2314 2345 2355-2368 2423-2424 2430 2484 2529 
2569 2576 2876 3215 3249 3272 3283 3532 3584 3627 3634 
371 1 3719-3722 3737 3839 3860-3884 401 1 4025 4038-4039 
4043 4055-4056 4173 4246 4282 4354 4375 4391 4434 4681 
4767 4781 4808 4964 4985 4998 5037 5163 5233 5274-5277 
5375-5392 5394-5402 5523 5569-5570 5581 5615 5723 5788 
5835 5902 5928 5936 6047 6078 6082 621 1 6288 6374 6433- 
6435 6512 6523-6531 6534 6595 6616 6625-6626 6788 6894 
6979 6999 7018-7026 7126 7166 7359 7473 7642 7653 7807 
7814 7877-7879 7965-7968 7970 7972-7980 8105 8108-8110 
8139 8246 8298 8345 8363 8368 8482 8603 8646 8884 8898 
8981 9010 9012 9076-9078 9098 9167-9179 9184-9189 9264 
9302 9304 9319-9320 9455 9466 9520 9530 9544 9556 9567 
9781 9895 9901 9928 9942 9947 9969 9999 10007 10069- 
10077 10079-10085 10177 10296 10326 10346 10376 10422 
10566 10602 10635-10638 10679 10685-10686 10729 10776 
11132 11220 11246 11314-11323 11325-11330 11417-11418 
11459 11513 11818 12000 12011 12017 12033 12039 12160 
12184-12185 12288-1229212295-12299 12301-12305 12363 

19^7^ lOIRt 19"3R7 1 9zt09 19zl13 1 9/Ld9 19zL££ 19^97 19^0^ 

12617 12636 12657-12658 12739-12740 12754 12772 12802- 
12809 12830 12835 1284M2842 12905 12923 12940 12976 
13003 13015 13017-13021 13051-13052 13117 13126 13136 
13960 11977 1T25H 1^90^ 1 ^^^6-1 IIAI 1^^67 1^449 1 1A^A 
13473 13477 13481 13495 13497 13499-13500 13507 13516 
13659 13670 13713 13715-13716 13748 13752-13759 13803 
13869 13872 13884-13885 13888 13893 13896 


adult spleen 


Clontech 


SPLcOl 


800 1927 4032 4834 6064 6135 6195 6446 6788 6873 7166 
7455 8966 9929 10744 12402 12564 12590 12691 12904 12933 
13082 13500 13506-13507 13516 13575 13864 13869 13883 
13889 


stomach 


Clontech 


STO001 


21 83 142 1004 1208-1215 1217-1219 1397 1399-1405 1671 
2315-2316 2345 2369-2373 2375 2575-2576 2809 2846 2984 
3136 3166 3537 3610 3698 3723-3725 3839 3885-3897 4057- 
4059 4173 4277 4410 4480 4667 4791 4808 4940 4987 5262 
5278-5281 5283-5284 5403-5405 5407-5424 5481 5656 561 A 
5796 5904 6418 6436-6440 6535-6540 6563 6627-6629 6765 
6940 7000-7001 7027-7030 7064 7135 7509 7604 7880-7885 

7Q81-7QQ0 8087 81 10 8190 814^ 8996 84^9 R^- 5 ! OOIO 007Q 

9081 9191 9193-9196 9304-9306 9313 9317 9321 9715 10007 
10011-10013 10086-10093 10178-10179 10603-10605 10640- 
10642 11069 11167 1 1221-1 1222 1 1331-1 1337 1 1339-1 1343 
11419 11513 11818 12186-12190 12307-12314 12327 12363 
12425-12427 12438 12617 12773-12774 12810-12811 12834 
13082 13103 13298-13299 13344-13349 13592 13630 13670 
13717 13760-13764 13782 13888 


thalamus 


Clontech 


THA002 


579 598 616 1065 1148 1220-1221 1223-1226 1407-1432 1597 
2266 2317-2319 2340 2342 2376-2378 2380 2431 2444 2555 
3093 3230 3286 3537 3722 3726-3732 3737 3898-3902 3904- 



126 



WO 01/64835 



PCT7US0 1/04927 



Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








3918 3920-3922 4060-4062 4173 4201 4282 4360 4391 5270 
5285-5288 5425-5438 5440-5449 5701 5902 5938 6137 6437 
6458 6461 6541-6544 6546-6550 6630 6755 7031 7095 71 IS} 
7166 7484 7579 7815 7849 7905 7977 7991-8000 8002-8005 
8126-8128 8134 8363 8558 8664 8786 8890 8930 8963 9082- 
9085 9128 9197-9199 9201-9216 9251 9264 9308 9503 9515 
9556 9646 9703 9928 10014-10017 10027 10094-10100 10102 
10767 10496 10606-10608 1O64^-1O6S0 1 0696 10891 11 14S 
11223-11225 11344-11355 11406-11407 11420 11513 11604 
11695 11791 11950 12022 12107 12191-1219912315-12331 
12363 12375 12405 12428-12430 12688 12706 12735 12748 
12754 12812-12813 12815 12835 12914 12959 13020 13023- 
13025 13060 13073 13300 13351-13358 13402-13403 13489 
13496 13506 13512 13719-13720 13765-13768 13770 13872 
13883 


thymus 


Clontech 


THM001 


51 142 150 332 346 360 438 546 731 760 895 1004 1104 1227- 
1234 1264 1391 1516 1577-1585 1812 1860 1877 2129 2174 
2215 2263 2321-2322 2408 2414 2425 2449 2490 2555 2569 
2575 261 1 2847 2880 3435 3530 3722 3727 3733 3735-3742 
3839 3870 4006 4041 4043-4046 4060 4109 4375 4396 4399 
4434 4667 4671 4759 4791 4885 4976 4987 5108 5289-5290 
5466 5481 5526 5553-5554 5796 5956 5979 6020 6186 6253 
6336 6371 6411 6438 6441-6447 6617-6621 6765 6788 6797 
6870 6886 6908 6972 6994 7059-7060 7126 7141 7166 7168 
7310 7383 7450 7494 7632 7716 7779 7887 7889-7891 8087 
8111-8114 81 17-8120 8139 8226 8343 8368 8370 8452 8456 
8633 8830 8898 8921 8963 9010 9076 9086-9087 9128 9134 
9264 9304 9307-9313 9401 9466 9497 9544 9563 9582 9612 
9626 9646 9758 9781 9927 9937 9969 10018 10065 10163- 
10166 10175 10284 10329 10414 10557 10609-10610 10617 
10679 10687-10691 10742 10760 10772 10776 10778 10881 
10891 11187 11194 11226-11229 11274 11406 11408-11410 
11419 1 14S0 1 1 SI ^ 11 676 1 1 60S 1 1 S^O 1 1 £6S 1 1Q49 1 7000 

12006 12016 12022 12092 12160 12186 12200-12204 12327 
12363 12414-12417 12427 12462 12470 12490 12564 12693 
12706 12735 12740 12835 12840 12898 12905 12910 12913- 
12914 12916 12977 13020 13036 13051 13062 13111 13126 
13141 13295 13301-13305 13326 13351 13391 13396-13397 
13456 13484 13498 13505 13507 13512 13516 13546 13713 
13803 13805 13808 13866 13869 13885 13888 


thymus 


Clontech 


THMc02 


16 27 51 67 142 390 598 1233 1493 1508 1586-1588 1598-1603 
1724 1841 1918 1927 1959 1976 1979 1989 2057 2107 2161 
2223 2290 2407 2426-2428 2432-2433 2444 2487 2569 2617 
3537 3664 3711 3720 3722 3737 3780 3817 4021 4025 4047- 
4049 4060 4063-4068 4246 4277 4377 4405 4688 4694 4759 
4791 4837 4949 4957 4992 4998 5037 5052 5108 5505 5553 
5555-5564 5571-5572 5683 5766 5772 5788 5796 5894 591 1 
6048 6186 6225 6288 6306 6333 6621 6728 6730 6765 6767 
6780 6788 6815 6867 6873 6979 7049 7061-7062 7166 7169 
7676 7687 7809 8084 8121 8129-8130 8179 8262 8354 8363 
8365 8375 8482 8597 8654 8740 8786 8791 8963 9076 9157 
9257 9264 9314 9323-9324 9401 9454 9466 9519 9529 9536 
9637 9700 9703 9927-9929 9940 10004 10007 10070 10167 
10169-10172 10180 10182 10^67 10284 10326 10331 10383 
10439 10452 10542 10605 10691 10693-1069410697 10744 
10775 10778 10795 11046 11098 11274 11413-11415 11462 
11519 11576 11592 11606 11618 11621 11627-11628 11693 
11807 11814 12022 12034 12044 12080-12081 12086 12160 
12236 12256 12327 12335 12363 12368 12387 12418-12419 
12424 12433 12556 12560 12564 12570 12577 12594 12599 
12612 12663 12735 12754 12836-12839 12844 12905 12913 
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Jl 1J>3 uc 

Origin 


r>xr a 
Source 


ijlul al y 

Name 










12955 12977 13020 13035 13037 13051 13062 13072 13087 
13193 13280 13295 13398 13404 13456 13488 13499-13500 
13503 13507 13516 13533 13535 13546 13611 13613 13630 
13646 13649 13659 13670 13678-13679 13713 13769 13775 
13803 13806-13808 13810-13811 13869 13872 13882-13883 
13885 13888-13889 


thyroid gland 


Clontech 


THR001 


49 115 142 360 641 698 800 1004 1134 1193 1233 1235 1237- 
1241 1329 1433-1471 1473-1476 1478-1479 1481 1589-1591 
1593-1594 1604 1606-1612 1614-1615 1639 1671 1906 1927 
1959 1976 1994 2090 2099 2134 2150 2161 2174 2186 2215 
2343 2367 2381-2387 2429 2434-2437 2449 2490 2510 2529 
2555 2569 2662 2827 2901 2961 2997 3001 3 109 3265 3286 
3425 3490 3503 3560 3643 3698 3722 3737 3743-3747 3797 
3839 3885 3923-3941 3943-3954 3956-3961 4009 4050-4053 
4060 4069-4082 4109 4173 4194 4253 4277 4282 4348 4354- 
4355 4376 4391 4396 4412 4434 4641 4689 4694 4755 4759 
4783 4791 4834 4957 4972 4985 4987 5018 5037 5052 5075- 
5076 5108 5147 5163 5171 5270 5292-5294 5317 5380 5450- 
5461 5464-5481 5503 5523 5565-5568 5573-5574 5693 571 1 
5724 5757 5772 5788 5894-5895 5917 5923 5938 5959 5966 
6005 6013 6027 6047-6048 6061 6064 6125 6135 6139 6189 
6215 6240 6336 6371 6374 6448-6449 6458 6508 6538 6551- 
6554 6556-6561 6563-6572 6595 6598 6607 6622-6623 6631- 
6636 6793 6803 6815 6873 6899 6955 6958 7032-7038 7040 
7043 7063 7066-7067 7079 71 16 7155 7233 7275 7455 7669 
7743 7792 7839 7845 7857 7892-7893 7910 7930 7989 8006- 
8008 8010-8033 8053 8067 8087 8092-8093 81 10 8116 8120 
8122-8124 813 1-8143 8196 8226 8235 8262 8345 8365 8368 
8370 8372 8375 8452 8520 8535 8543 8597 8646 87 1 1 8760 
8816 8839 8863 8898 8930 8948 8951 8963 8966 9010 9055 
9088-9091 9142 9157 9217-9220 9222-9227 9229-9242 9244 . 
9257 9264 9304 9315-9316 9325-9330 9368 9401 9455 9466 
9484 9525 9542 9544 9563 9646 9695 9703 9780-9781 9S55 
9928 9942 9947 10000 10019 10027 10103-10117 10119-10120 
10173-10175 10183-10185 10346 10376 10440 10470 10496 
10611 10651-10657 10666 10679 10695 10698-10704 10760 
10772 10778 10782 10788 10891 11033 11051-11052 11061 
11063 11112 11231 11256 11356-11357 11359-11360 11362- 
11367 11369-11372 11376 11385 11406 11416 11421-11425 
11462 11513 11588 11605-11607 11620 11833 11869 11932 
11980 11985 12006 12017 12033 12143 12160 12193 12195 
12205 12207-12208 12292 12332-12336 12338-12345 12347 
12349-12356 12358-12360 12363 12368 12405 12420 12422 
12424 12428 12434-12439 12465 12523 12601 12605 12609- 
12611 12617 12636 12645 12670 12691 12707 12740 12749 
12754 12768 12775 12777 12817-12824 12830 12840 12845- 
12848 12863 12911 12940 12955 12977 13020 13026-13028 

1 ^H^S 1 1 QfK1 1 *30£9 1 ^fWO 1 LCl'l'X 1 30QO 1 I A/1 1Q 1 1 7 

13126 13195 13202 13254 13295-13296 13360-13371 13389 
13391 13399 13412 13479 13488-13489 13492 13494 13498 
13500 13503-13504 13506-13507 13511-13512 13516 13533 
13551 13554 13590 13613 13627 13644 13646 13656 13659- 
13660 13670 13713 13721 13743 13771-1377413782 13803 
13809 13812-13814 13860 13866-13867 13869 13872 13882 
13884-13885 13888 13891 13893 


trachea 


Clontech 


TRC001 


83 438 483 858 1006 1034 1242-1245 1359 1483-1494 1616- 
1619 1621 1671 2266 2324 2388-2391 2409 2438-24392444 
2575 3001 3136 3272 3425 3460 3535 3548 3748-3756 3810 
3962-3967 4083-4086 4530 4755 4758 4949 4957 4987 5075 
5213 5295-5299 5481-5482 5484 5486-5488 5490-5491 5535 
5576-5580 5656 5941 6091 6450-6453 6574-6578 6637 6797 
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Tissue 
Origin 


RNA 
Source 


Library 
Name 


SEQ ID NOS: 








6955 7041-7042 7049 7068 7204 7347 7733 7864 7894 8034- 
8043 8045 8120 8139 8144 8563 8635 8646 9012 9092-9099 
9245-9251 9317 9331 9333 9484 9503 9517 9734 10020-10022 
10027 10097 10121-10123 10125-10127 10284 10344 10478 
10612 10658-1065910705 10812 10948 1 1 145 1 1233-11239 
11373-11376 11428-11429 11942 12022 12209-12211 12292 
12361-12363 12365-12366 12438 12440-12441 12611 12617 
12734-12735 12825-12826 12900 12904-12905 12916 12923 
12965 12972 12976 12989 13029-13030 13306-13308 13372- 
13373 13492 13670 13713 13722 13775-13781 13803 13884 


uterus 


Clontech 


UTR001 


1036 1134 1246-1250 1324 1493 1495-1511 1622 1671 1928 
2145 2266 2310 2326 2343 2367 2392-2397 2555 3281 3479 
3720 3757-3760 3762 3969-3986 4088-4090 4192 4201 4246 
4277 4390-4391 4434 4515 4998 5002 5184 5300-5301 5481 
5492-5504 5656 5695 5712 5794 5936 6116 6475 6579-6587 
6595 6870 7002 7043-7044 73 1 1 7484 7895-7899 8046-8055 
8057-8059 8061 8110 8146-8147 8152 8226 8262 8622 8677 
9008 9092 9137 9252-9257 9334 9370 9414 9466 9539 9703 
10023-10024 10128-10134 10346 10595 10660-1066810706- 
10708 11145 11239 11289 11377 11379-1 1385 1 1695 11830 
12367-12369 12371-12374 12442-12443 12570 12670 12693 
12827-12829 12914 13002 13047 13073 13083-13084 13131 
13260 13277 13309-13310 13374-13376 13394 13489 13512 
13713 13782 13784-13787 13866 13869 
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TABLE 2 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Id en tit 

y 


i 


iVjL) CO Ul 


— ; 

Homo sapiens 


i — T o 1 — 

alpna-2-rnacroglobuiin 


lift 


oy 


2 


AF1 18090 


Homo sapiens 


PRO2044 


247 


59 


J 


VA1 £.01 

AO Lo&J 


Homo sapiens 


alpha 1 -antitrypsin 




no 
/o 


4 




Homo sapiens 


reverse transcriptase 




1*7 


5 


M22332 


Homo sapiens 


unknown protein 


89 


40 


6 


A T? A 1 CCO A 


Mytilus edulis 


precollagen P 


1 in 

1 1j> 


33 


7 


A0j325 


Homo sapiens 


apolipoprotein B fragment 


C A A 

j4U 




8 


AB019280 


Mus musculus 


sprouty-4 


91 


35 




D88 152 


Homo sapiens 


acetyl-coenzyme A transporter 


625 


87 


10 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


124 


58 


11 


AL049569 


Homo sapiens 


dJ37C10.5 (KIAA0445) 


182 


82 


12 


AJ242540 


Vol vox carter i f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


85 


37 


13 


L27428 


Homo sapiens 


reverse transcriptase 


135 


61 


14 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


266 


72 


15 


U93569 


Homo sapiens 


putative pi 50 


135 


37 


16 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


67 


17 


X53581 


Rattus 
norvegicus 


ORF4 


124 


34 


18 


AF1 83961 


Homo sapiens 


carbon catabolite repression 4 protein 
homolog 


431 


75 


19 


AJ002190 


Homo sapiens 


dihydroxyacetone phosphate 
acyltransferase 


551 


88 


20 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotem 


127 


45 


21 


AK001269 


Homo sapiens 


unnamed protein product 


1643 


99 


22 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


275 


58 


23 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


123 


75 


24 


AF1 56550 


Mus musculus 


putative E1-E2 ATPase 


168 


58 


25 


AF 119856 


Homo sapiens 


PR01851 


585 


83 


26 


U49974 


Homo sapiens 


mariner transposase 


187 


46 


27 


G00901 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4982. 


86 


30 


28 


AF29577^> 


Homo sapiens 


ral guanine nucleotide dissociation 
stimulator 


126 


74 


29 


AF1 13685 


Homo sapiens 


PRO0974 


92 


73 


30 


U83303 


Homo sapiens 


line- 1 reverse transcriptase 


102 


50 


31 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


279 


75 


JZ 


Ar00:>535 


Homo sapiens 


ORF2-like protein 


114 


An 

47 


j3 


MI5386 


Homo sapiens 


gamma-globin 


:>7u 


OA 

84 


j»4 


M19419 


Mus musculus 


proline-rich salivary protein 


1 1 A 

110 


1 c 

35 


JO 


A "CO 1 1 A/1 ^ 

Ar 21 1943 


Homo sapiens 


WW domain-containing protein 

WHY 


c o c 

586 


83 


36 


X13S85 


Nicotiana 
tabacum 


extensin(AA 1-620) 


103 


35 


37 


U93563 


Homo sapiens 


putative pi 50 


127 


58 


38 


U93564 


Homo sapiens 


putative pi 50 


103 


77 


39 


AF069732 


Homo sapiens 


ADA2-like protein 


524 


88 


40 


X61046 


Hydra sp. 


mini-collagen 


101 


34 


41 


AK000322 


Homo sapiens 


unnamed protein product 


566 


80 


42 


G03646 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


57 
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SEQID 
NO: 


Accession No, 


Species 


Description 


Smith- 
Waterman 


% 

Identit 

y 








7727. 






43 


AF071081 


IvTvcobacteriurn 
tuberculosis 


oro line-rich mucin hornolos? 


104 


41 


44 


AF092135 


Homo sapiens 


PTD014 


228 


41 


45 


Y73353 


Homo sapiens 


HTRM clone 1870914 protein 
sequence. 


295 


56 


46 


AF1 18082 


Homo sapiens 


PRO 1902 


119 


44 


47 


X78926 


Homo sapiens 


zinc finger protein 


442 


52 


48 


X54326 


Homo sapiens 


glutaminyl-tRNA synthetase 


542 


81 


49 


D50645 


Homo sapiens 


SDF2 


321 


95 


50 


M92439 


Homo sapiens 


leucine-rich protein 


344 


80 


51 


U28963 


Homo sapiens 


Gps2 


593 


82 


52 


U41806 


Homo sapiens 


p60 


660 


81 


53 


AF181490 


Homo sapiens 


prenylcysteine lyase 


461 


78 


54 


U93570 


Homo saoiens 


tyutative ol 50 


147 


36 


55 


W73499 


Homo sar>ipris 


Von WillphranH factor 


529 


76 


56 


AF1 19851 


Homo sapiens 


PRO 1722 


126 


57 


57 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


115 


61 


58 


AT 0219^59 




dT352A?0 1 faldehvde nehvdroo-enase 
family protein) 


422 




59 


L24158 


Homo sapiens 


integrin alpha 9 protein 


117 


71 


60 


Y32157 


Homo sapiens 


Human SH3 D 1 A protein . 


530 


91 


61 


X61296 


Rattus 
norvegicus 


open reading frame 2 


117 


31 


62 


AK002064 


Homo sapiens 


unnamed protein product 


330 


80 


63 


ABO 122^3 


Oanis familiaris 


ORF2 


80 


56 


64 


U93570 


Homo sapiens 


putative pi 50 


113 


37 




TTl ^647 




ICVCloC Li ailbUl ljJlu.S)C 


1 s? 






00^41 1 




T-Iiim?»r» cPTTP'tprl T^vritpin QT- (~} Tl"^ TvIO* 
OUlilcUJ bCl/lCt-CU JpiLHGJUlj k5J_/V^ LIS i.NV-J. 

7492. 






67 


Y48359 


T-Tnmo saniens 


Human nrostate cancer-associated 
protein 56. 


590 


99 


68 


W74879 


Homo sapiens 


Human secreted protein encoded by 
gene 151 clone HTLEF62. 


368 


98 


69 


AF 104921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


604 


93 


70 


AF 175265 


Homo saniens 


vacuolar sortinfr orotein 35 


632 


88 


71 


U93571 


Homo saoiens 


p40 


106 


33 


72 


XI 5324 


Homo saoiens 


an ffi oten si n o & en 


330 


84 


73 


Z98204 


Hordeum 
vulgare 


extensin 


111 


38 


74 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


61 


75 


AF 11 8092 


Homo sapiens 


PRO2061 


453 


79 


76 


M63175 


Homo sapiens 


autocrine motility factor receptor 


190 


85 


77 


M26361 


Mus musculus 


LINE/Ig H-chain fusion protein 


153 


38 


78 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NOt 
6613. 


171 


58 


79 


X78926 


Homo sapiens 


zinc finger protein 


199 


37 


80 


M77381 


Homo sapiens 


acrosin 


98 


54 


81 


G02538 


T Trim rt earner, s 


Human <?ppre»tpH nrntpin £11? O TD MO" 

6619. 




44 


82 


U93569 


Homo sapiens 


putative pi 50 


94 


38 


83 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


100 


84 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


129 


34 


85 


R59837 


Homo sapiens 


Sequence of human microtubule- 


82 


48 



131 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 












y 








PSsnfMfttpH rvrntpin tan 

CIl3jWL*UL LV^ LX L/XxJctilXX LU.H. 






86 


Y02749 


T-Trvmrv lanipn^ 

J. X.\JHX\J l>u[J l^llo 


T-Tnman ^pfrpfprl r»iwf"Pin pncnfipH V>v 

XXUXXXCt.il ObL/l V^Lt-VJ L»l V/LwXXX V.lIvUUvVJ. J 


157 


if\ 

i \j 








gene 100 clone HNFIU96. 






87 


AF1 16712 

I 1.1 X X vr / x £-* 


Homo ^aniens 


PRCP738 


91 


58 


88 


G02485 


Homo sapiens 


Human secreted protein, SEQ IE) NO; 


77 


44 








6566. 






89 


Y82742 


Homo saniens 


DNA renlication and renair associated 


315 


79 








protein (DRASP). 






90 


M16961 


Homo saoiens 


al Dh a-2 -HS» cr 1 vconrote in 


138 


74 


91 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


250 


56 








similar to pogo element 






92 


AF220656 


Homo saniens 


anontosis-assoeiated unclear nrotein 


62 


69 








PHLDA1 






93 


U65928 


Homo saniens 


Tun activation domain hindinE? nrotein 

JUll ClvIX V CllXVXX UAJIXXCHXX L/ 11 JUHJj_. L/X l/LVslXX 


188 


75 


94 


U93568 


Homo saniens 


nutative nl SO 


102 


48 


95 


S80119 


I? attus sn 


reverse transcrintase homolocr 


130 


53 


96 


U93563 


Homo saniens 


nutative n ! 50 


242 


50 


97 


Y86472 


XXVJlllv &CtJL/iv7I10 


HiiTnan ophp N^-pncnrlprl rvrrifpiri 

XXIXlild.ll gCllv V/11UUV1C\X kJlV/LVvULl 


105 


46 








fragment 5vFO TP) N0 387 






OR 


001714 

\~l\JJ l 1 *T 


W /">m ri ca r>ipn q 

X XCiilC dajJICriXk} 


IJiimfln cprrptpH nrntpin *sPO TD T\FO* 

XXtXiXiClii iCLiClCU ^JKJlfZLHy iJijy iJL/ ZNV-/. 


X -J 


67 








7795, 








^740 AS 


J3 v.] 11 lib ^alJallUi* 


yi Cpi VaiUlllilill 




o_> 


1VJU 


AF1 1 ROQO 

/TLX I 1 OU!7L» 


XlUixxU adpiCIlO 


P"RO90d4 
x rvvzuT-T 1 


Z,U7 


QO 


101 




XxUiiHJ bapiciii) 


v7l\r J j IVlXjXXJ? / j pUldLlVC Ll cxllbjJUbaaC 


1 OR 

170 


SI 








Similar tr> nncrn plpmpnt 

OXXllllCXX UV/ J^UgU V/l^XJlVvlXL 






102 


noun 


X X *J 1 1 1 U 30.JJlC.lio 


T-Tiiman QPfrptpH nrntpin ^T-«0 TO T\TO* 

XXlXlllClXl oLUCLCU LXlULClli, OJ_iV^ X 1 — ' LNV7. 


88 


53 








7214. 






103 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


225 


82 








fragment, SEQ ID NO:490. 






104 


AF003535 


Homo sapiens 


ORF2-like protein 


114 


47 


105 


AF 13 0079 


Homo sapiens 


PR02852 


133 


56 


106 


AF 13 0089 


Homo sapiens 


PRO2550 


107 


71 


107 


M63473 


Homo saniens 


alnha-5 tvne TV oollacren 

CXXLfXXCX — f \,y LJ\j X V V»WXJCXg^lX 


131 


45 


108 


AF1 16661 

lil X X Ul/U X 


Homo saniens 


PRO 143 8 


112 


54 


109 


X92485 


PI Rsm odium 

X XU.lJ11XWVXXU.1XX 


pval 


101 


41 






vivax 








110 


AF010144 


Homo sapiens 


neuronal thread protein AE)7c-l v ITP 


113 


80 


111 


U49973 


Homo saniens 


ORF1 * MER37" nutative transoosase 


199 


69 








similar to pogo element 






112 


AF194537 


Homo sapiens 


NAG 13 


104 


44 


113 


L27428 


Homo sapiens 


reverse transcriptase 


160 


34 


114 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


137 


56 


115 


U49973 


Homo saniens 


ORFl, lvlER37j putative transposase 


191 


67 








similar to pogo element 






116 


AF 13 0052 


Homo sapiens 


PRO0956 


163 


47 


117 


t 97428 


Homo saniens 


revpr^e tranQr.rinta^e 

X \J V wi Jv LI CllXoL/X XL/LClOv 


117 


36 


118 


IJ93569 


XAvlllVJ DclJJlVllO 


mitativp r>1 ^0 

JJLXLCILIVC J-/ 1 \J 


104 


66 


119 


WtOJJ 1 


T-ToTTin «jfiT"\i pne 

XX\Jlxxk» acltJxGlIa 


T-Tiimpn ht*P5ict puti r*pr TplatpH rirotpin 
XXLillldll LrlveLoL L>u.llL>C>x 1 C1CXLCU. J^/i LILCXLl 












BCRB2 






120 


G00357 


Homo sapiens 


Human secreted nrotein SHO TO NO' 

X.XUXXXC4-XX JUvAwlvU L/X \J Iwill) k-7J— /V^ !!✓ 1 1 V» 


81 


57 








4438. 






191 


\J\J Jt J o 


T-Tnmr. campnc 


xiuiiiaii bcureieu pxULciii^ oJCrVi ij-' inw. 


/ o 


^1 
j i 








7519. 






122 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


95 


80 








6613. 






123 


X61296 


Rattus 


open reading frame 2 


94 


36 






norvegicus 








124 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


131 


60 








7882. 







132 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


125 


AF1 19900 


Homo ^aniens 


PR02829 


168 


uo 


126 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


209 


58 


127 




X 1<JLLL\J oCtj^JLCllo 


ICVCiSC CI cLLLi}{,rLpia$C 


1 07 
i yjjL. 


3s 
j> j 


128 


G02607 


£Xt_FllllS «3Q.j_/I&IID 


Human Ci»rT^»t*=»H nrnt^in QFD ID TSJO* 

JLXUIIICUI aCL/1 CICU pi ULC1JJ., l>JUV</ 11^ 1NV_/. 

6688. 


90 


uj 


129 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


113 


73 


130 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


170 


36 


131 


U93572 


Homo sapiens 


putative pi 50 


168 


38 




TT17961 


T— T rii-vi /~\ cambric 
JTHJIIIU adpiCHS 


X^JXjTiJ-J -^lllU linger pJULClIi 5 IVICLIIUU- 

conceptual translation supplied by 

CXLLIUKJI 




^7 
3 / 


133 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

44 84 


137 


92 


134 


B08918 


Homo sapiens 


Human secreted protein sequence 
cncoticu. oy gene zo oxz/v^ liv inla / j. 


58 


61 






XiOIIlvJ i>ctpiCIIb 


T-TiirY>QTi cpfrp+pH fkt*r»+^i'n i-< (^\ i f^i ~Mtf"l» 
XTUIIldll bCUClcU prULClll, OCy lJL-» INvJ. 

7870. 




DO 


116 


n/)l 1 77 

UvJ 1 / Z 


rxuiiio sdpicus 


xiuiTidii secreieci proiein, oc^ nu. 
7253. 




JO 


137 


T I4QQ71 


XlUIIiU ad.JJJ.CIlc> 


WJCvT 1 ? J.V1 JZvXVw' /, puidllVC LIaI15>p<Ji>di>C 

similar to pogo element 


7^?n 


JO 






rj-LHIlU aapiCIlb 


Ul lllctlUcU pi LHCIII piULlUCL 


1 97 


fO 


11Q 


AJjJO I 


XvalLUb 

norvegicus 




1 ^£ 


Jo 






rruxiio odpicilb 


v.'xsx i , 1V1JC/XV3 / , puiaLive tr cinsposase 
similar to pogo element 


i -*:a 

1 JO 


H-O 


141 


TT4QQ71 


JLJLLM I LVJ adpiciib 


v/xsjr i, iv i J3< r\j / y puiaLive Liaiibpubdbc 

<sim 1 1 ?ir to nriffn plpmptrt 
oiixiiicu iu ^jyjgyj t/idiiwiiL 


1 oz 


6^ 


142 


AF090930 


Homo sapiens 


PRO0478 


127 


65 


141 




JJLU11IV oopiCllo 


nuiiidii scuicLcu protcm, ocy liv inu. 
7871. 


141 


Jo 


144 


Y86472 


jnuixivJ adjpidio 


T-Tiim5iM cjf^nf^ ^s^frn f^t-r^ti^in 
rXLlIIlClIl £^C/Llv7 JZ."*CHL<ULICU pi ULC1I1 

fragment, SEQ ID NO:387. 




6^ 

OJ 


145 


AT9185R8 


Rpinni^t vii1o"afi^ 

OwlUl CIO V Llli^Cli lij 


^ jf tVJL/lll L/IliC L» UAlUdoC oUU LU11L JLJL1 


417 


79 

/ z 


146 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


139 


76 


147 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


40 


148 


AF010144 


^—f r\~rt~t o can i c 

XlVJIIIVJ bdpiCIlD 


liCLil UHal LiJU cad pi ULC1H Jt\LJ f U~IN i x 


1 9 1 

IZ 1 


70 
/o 


149 


Y76184 


Homo sapiens 


Human secreted protein encoded by 

0f>.r\ p ^1 
w,wlC UI. 


214 


85 


150 


Y91577 


Homo <if»nipn«s 


T-Tumj^Ti QPPTptpH T*iT*rttf*in cpnuptipp 

J. J. LUll Oil SCU CLCU pi SJ LCI 11 oCLJLICIIL'U 

encoded by gene 2 SEQ ID NO:250. 




S7 


151 


G00417 


Hnmn canipnc 
XAUiiivJ adpiC/lio 


XlUlIlUll aCtl CLCLA pi WLC111 ? iZ>IZiV^ I LJ 1NU. 

4518. 


196 


66 


152 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


105 


51 


153 


AF1 19900 


Homo saoiens 


PR02822 


116 


62 


154 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


149 


66 


155 


AB009993 


Mus musculus 


collagen al(V) 


105 


36 


156 


U49973 


Homo saniens 


ORF1 * MER37* nutative transnosase 
similar to pogo element 


155 


69 


157 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


348 


71 


158 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


54 


159 


AF247705 


Oryctolagus 
cuniculus 


alpha 1 type X collagen 


102 


42 



133 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


lOU 


P0501 3 


xionio sd.pi ens 


iNeurai mreau proxem. 


OO 


DO 


1 61 

101 




Homo sapiens 


Human breast cancer related protein 


1 19 
1 1Z 


JZ 














lOz 


A / 1 44z 


Rattus 


vJKr 1 3 putative 


70 


47 






norvegicus 








loJ 




Homo sapiens 


putative pi 50 


1 1 Q 
I 1 O 


3o 


1 /I 

104 


UzJ5 ID 


Caen orhab diti s 


weak similarity to adenylyl cyclase- 


93 


37 






elegans 1 


associated protein (CAP) and to P. 












cnaoauai aoami major merozoixe suriac 












anti crp>n r»rr»tp^in fPTT? * A 39 5^5^ Final 












exon overlaps gene predicted on other 












z>Ll all LI. 






165 


003963 

VJ \JJZ,\jJ 


nun 10 sapiciio 


JTlLXlllall SCL/1CLCU piUlClIl, u>JDV^ 1JL/ 1NU. 


1 43 


66 
oo 








7344 






166 


r\x 1 J\J\J / y 


ITOIIIU oapiCllo 


PR09859 

I XV \J Z. O J jL 


1 43 


on 


167 


T 97498 


T—T /"vrYt /"\ poni/inc 
JaUlIlO bapicllo 


reverse udnsenptase 


900 


5T 


loo 


OU4U /z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


Vz 


Z>1 








81 53 

O 1 ^> J . 






160 


PQS01 3 


noiiiu Sapiens 


xNeiirai Lnredci protein. 


1 1 6 
1 I o 


54 


i /u 


V01 577 


nuniu sapiens 


xx uiiiaii seuieieu pruiem i>euueiiL«c 


394 










encooeu oy gene z ojdv^ iiv inw.zju. 






171 


TT83303 


XJ.CUI1U SdpiGllI> 


line - 1 ievciisc uaxisci lpia^e 


1 1 1 
111 


50 


179 

1 /Z 


003700 


xnomt) sapiens 


xi mil ail seureteu protein, om-jK^ iu inw. 


111 
111 


60 

027 








7871 

/ O / X . 






i / j 


AF1 30080 


rxunnj octpiciiD 


PRO? 5 50 


196 


50 

J27 


174 


S801 19 


P Qtt"i 1 c; en 


ICVClbC LI aiiodl 1U LaaC ilUllXUltJ^ 


1 5 1 


46 


175 


G02451 


X X wil l \J odpicilo 


tTnmfin QfT'rp'tpH T>rot<=»in QPO TO TxJO* 

XX 1X1 11 ail DCwlClCU. piVJlCXUj OXjVJJ 11_7 iiv/. 


1 1 3 
ii-? 


53 








6532. 






176 


U 15647 


IVTlJcl mil^CllllJ^ 


rftvpr^ft trati^rrrinta^ft 

IwWldU tl CUlOvlipiuOV 


104 


46 


177 


M24732 


Homo ^aniens 


lamin-lilcfi rvrotein 

ICilll L1JL IIIVvj yJ 1 \-f LV>1 1 1 


1 12 


42 


178 

I/O 


AF041 330 


DUUU ocil ItiH j 


"M A FiT-T rlohv/Htv'icroncioo QiiT^nnii" 5 
in^tlJuti uciiy ui ugciictoc duulhul j 


1 37 


<2» O 


179 


AF0 1 0 1 44 


T-Tomo Q?irWpn<i 

1 JL l_/ i 1 1 V/ OtXLJldllo 


npiirotiftl fhrpiid nrotf^in Arj7f x -TvTT'P 

IlC U I Ulial Ll 11 Call pi ULCUl ivlS 1 IN X XT 


143 


58 


JL OW 


AF104537 

/ VL 127*T_/.J/ 


Rnm A c n rst p>n Q 

XX vJ ill V/ DuUlCllD 


TvJAOl 3 


09 


00 

2?0 


181 


T I03564 


xxUiiiU sapiens 


j-f ofrii/ja r>1 
pUtLaLlVe piJl/ 


131 
iJi 


5^ 


1 89 


T T03574 


xiVimj Sapjcllb 


puiaiive puu 


86 
oO 


H-O 


1 83 


V141 66 


VJallLlS galluci 


aLLaL/llllieilt XCglvJIl UlULllllg piutciii 


01 


AO 


IOH 


003807 


ixomo Sapiens 


xiuniaii seLreieti protem, oxiv^ iu inv.^. 


1 04 


Of 








7888 






1 85 


T T4QQ73 


nomo sapiens 


vyivT i , iviE/ix j / , puiative transposase 




TX 








oinixiai isj pugu eiciiiciiL 






186 


U93572 


STLSJlllSj oClJUlCLlo 


vtiitati vp nl 50 

piXLCtLlVC UlJv 


130 


64 


187 


M99339 


iivxixivJ oapiciiD 


UlllvllUWXX pujicxn 


70 


41 

L T 1 


1 88 
1 oo 


V87909 
i o y zoz 


xiuino sapiens 


xi Liiii an seLreieu prutem setjuenL/e »r>i3\/ 


75 


71 
/ 1 








IT) "NTO-941 






189 


U70935 


Pprnm vcpi i c 

X Cl L/lil V oIjIIj 


ICVClJV Ll C4lloL/l l\j LatiC 


1 39 


37 






vn n n 1 r^i 1 1 atii q 

illClUlV^LilCXHXo 








100 


<n801 10 


XvallUS bp. 


reverse Lrdiisenptase noinoiog 


1 79 
1 /Z 


43 


101 

127 1 


AF1 04537 


Xioiiio sapiens 


"NTAOI 3 

lN/\vJ 1 J 


8 1 
51 


75 


109 


V45389 


XTivJiiiv.' i>apieiis> 


XJ7 im on oprrptpH v»T , /^i*oir> froornptif 
jnuiiiaii bccx etcLi pi<jLeixi xiagiixciit 


1 19 

1 1Z 


64 








primrfpH "frrvm crpnf* 98 






193 


X92485 


x laoixiVJUiuiii 


pval 


06 


40 






vivax 








194 


R13556 


Homo ^amen^ 


Protein encoded downstream of hhc TVT 


1 12 


45 








oncoprotein. 






195 


L27428 


Homo sapiens 


reverse transcriptase 


141 


37 


196 


U93570 


Homo sapiens 


putative pi 50 


201 


41 


197 


X92485 


Plasmodium 


pval 


120 


48 






vivax 








198 


AF130089 


Homo sapiens 


PRO2550 


137 


60 



134 



WO 01/64835 



PCT7US0 1/04927 



SFO ID 


Arrpccinn l\Ir\ 




Description 


onii ui- 


OS 

/o 


NO: 








Waterman 


Identit 










Score 


y 


199 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


84 


200 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


52 


201 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 


151 


68 








fragment, SEQ ID NO:490. 






202 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


247 


78 








similar to pogo element 






203 


W48351 


Homo sapiens 


Human breast cancer related protein 


126 


56 








BCRB2. 






204 


Y91577 


Homo sapiens 


Human secreted protein sequence 


362 


80 








encoded by gene 2 SEQ ID NO:250. 






205 


R59842 


Homo sapiens 


ApoE4Ll protease. 


100 


82 


206 


AF161356 


Homo sapiens 


HSPC093 


78 


62 


207 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


111 


48 








4581. 






208 


U83280 


Leishmania 


39 kDa antigen 


121 


53 






donovani 






209 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


80 








7795. 






210 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 


146 


72 








similar to pogo element 






211 


M69197 


Homo sapiens 


haptoglobin-related protein 


344 


92 


212 


AF034611 


Homo sapiens 


intrinsic factor-B12 receptor precursor; 


123 


37 








cubilin 






213 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


129 


65 








SEQ ID NO: 163. 






214 


V00662 


Homo sapiens 


cytochrome oxidase 1 


485 


87 


215 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


80 








6595. 






216 


U353 12 


Mus musculus 


nuclear receptor co-repressor 


115 


47 


217 


L26953 


Homo sapiens 


chromosomal protein 


143 


77 


213 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


224 


70 


219 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


75 








7534. 






220 


AB018114 


Arabidopsis 


RING finger protein-like 


111 


38 






thaliana 








221 


D38112 


Homo sapiens 


ATPase subunit 6 


475 


84 


222 


V00662 


Homo sapiens 1 


cytochrome B 


466 


77 


223 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


175 


85 








SEQ ID NO: 163. 






224 


AF010144 . 


Homo sapiens 


neuronal thread protein AD7c~NTP 


125 


38 


225 


Y02999 


Homo sapiens 


Fragment of human secreted protein 


86 


65 








encoded by gene 121. 






226 


X77816 


Rattus 


PR-Vbetal 


130 


54 






norvegicus 








227 


U09500 


Homo sapiens 


cytochrome b 


274 


62 


228 


AF081104 


Mus musculus 


ORF2 


111 


36 






domesticus 








229 


AF090942 


Homo sapiens 


PRO0657 


88 


57 


230 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


76 


57 








8172. 






231 


G03472 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


66 








7553. 






232 


B12310 


Homo sapiens 


Human secreted protein encoded by 


116 


54 








gene 10 clone HDPGP94. 






233 


AF0I0400 


Homo sapiens 


transaldolase-related protein i 


253 


77 


234 


AF197913 


Helicoverpa 


basic DNA-binding protein BDBP 


137 


50 






armigera nuclear 












polyhedrosis 












virus 









135 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 




Species 


Description 


Sin i tii- 


At 


NO: 








Waterman 


Identit 










Score 


y 


235 


AF090931 


Homo sapiens 


PRO0483 


123 


75 


236 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


185 


86 








similar to pogo element 






237 


Ml 9503 


Homo sapiens 


ORF1; putative 


99 


40 


238 


G0Q36S 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


93 


58 








4449. 






239 


AFO 14883 


Homo sapiens 


NADH dehydrogenase subunit 2 


305 


65 


240 


G03102 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


60 


44 








7183. 






241 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


79 


50 








fragment, SEQ ID NO:387. 






242 


U15647 


Mus museums 


reverse transcriptase 


117 


47 


243 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


56 


244 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


45 








7214. 






245 


AK023542 


Homo sapiens 


unnamed protein product 


82 


38 


246 


X55702 


Drosophila 


polycomb protein 


84 


31 






melanogaster 








247 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


65 








7870. 






248 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


70 








6613. 






249 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 


142 


63 








IDNO:1051. 






250 


Y17832 


Human 


env protein 


297 


71 






endogenous 












retrovirus K 








251 


U93568 


Homo sapiens 


p40 


103 


46 


252 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


201 


47 


253 


AF090895 


Homo sapiens 


PRO01I7 


139 


60 


254 


Y02886 


Homo sapiens 


Fragment of human secreted protein 


185 


72 








encoded by gene 90. 






255 


L27428 


Homo sapiens 


reverse transcriptase 


156 


40 


256 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


157 


59 








similar to pogo element 






257 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


70 








4414. 






258 


AF194537 


Homo sapiens 


NAG 13 


141 


38 


259 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


115 


71 


260 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


53 








4718. 






261 


M22332 


Homo sapiens 


unknown protein 


78 


45 


262 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


70 


78 


263 


X61296 


Rattus 


open reading frame 2 


108 


36 






norvegicus 








264 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


178 


45 


265 


G03303 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


63 








7384. 






266 


Y01154 


Homo sapiens 


Protein sequence Seq Id 54 from 


116 


84 








WO9901020. 






267 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


118 


52 








7867. 






268 


AF1 19855 


Homo sapiens 


PRO 1847 


74 


70 


269 


AF 109907 


Homo sapiens 


S164 


85 


61 


270 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


63 








4414. 






271 


X92485 


Plasmodium 


pval 


107 


72 






vivax 








272 


AF1 94537 


Homo sapiens 


NAG 13 


167 


51 



136 
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cirri in 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


273 


U93564 


Homo sapiens 


p40 


104 


40 


274 


L27428 


Homo sapiens 


reverse transcriptase 


142 


56 


275 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


150 


60 








4435. 






276 


X61296 


Rattus 


open reading frame 2 


96 


48 






norvegicus 








277 


AF090931 


Homo sapiens 


PRO0483 


140 


65 


278 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


231 


66 








similar to pogo element 






279 


AF1300S9 


Homo sapiens 


PRO2550 


164 


60 


280 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 


98 


58 








protein 






281 


L22548 


Homo sapiens 


collagen type XVIII alpha 1 


92 


38 


282 


AP000616 


Oryza sativa 


similar to RING-1 12 finger protein 


130 


68 








RHAla(AF078683) 






283 


AF116715 


Homo sapiens 


PR02829 


160 


75 


284 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


53 


285 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


53 








7871. 






286 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


53 








BCRB2. 






287 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


147 


66 








similar to pogo element 






288 


U93572 


Homo sapiens 


putative pi 50 


125 


32 


289 


AL050399 


Arabidopsis 


putative proline-rich protein 


142 


44 






thaliana 








290 


X92485 


Plasmodium 


pval 


147 


43 






vivax 








291 


AB047600 


Macaca 


hypothetical protein 


172 


66 






fascicularis 








292 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


108 


55 








7870. 






293 


D38115 


Pongo pygmaeus 


NADH dehydrogenase subunit 5 


342 


71 


294 


AF090942 


Homo sapiens 


PRO0657 


99 


66 


295 


M61185 


Bos taurus 


glutamic acid-rich protein 


114 


44 


296 


M13100 


Rattus 


unknown protein 


107 


43 






norvegicus 








297 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


50 


298 


X92485 


Plasmodium 


pval 


93 


78 






vivax 








299 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


75 








7253. 






300 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


42 


301 


L24521 


Homo sapiens 


transformation-related protein 


117 


60 


302 


X83413 


Human 


US 8 


219 


49 






herpesvirus 6 








303 


U93567 


Homo sapiens 


putative pi 50 


130 


48 


304 


B08918 


Homo sapiens 


Human secreted protein sequence 


72 


61 








encoded by gene 28 SEQ ID NO:75. 






305 


AJ242540 


Volvox carteri f. 


hydroxyproline-rich glycoprotein DZ- 


153 


68 






nagariensis 


HRGP 






306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


532 


79 


307 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


84 


53 








7795. 






308 


L27428 


Homo sapiens 


reverse transcriptase 


151 


72 


309 


M69297 


Homo sapiens 


ORF3 


145 


43 


310 


X92485 


Plasmodium 


pval 


81 


60 






vivax 








311 


L27428 


Homo sapiens 


reverse transcriptase 


103 


41 



137 



WO 01/64835 



PCT7US0 1/04927 



crn I'D 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Id en tit 










Score 


y 


312 


AF130079 


Homo sapiens 


PR02852 


135 


49 


313 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


58 








7867. 






314 


AF090928 


Homo sapiens 


PRO0470 


88 


48 


315 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


175 


64 








4433. 






316 


U93568 


Homo sapiens 


putative pi 50 


148 


46 


317 


AF1 19855 


Homo sapiens 


PRO 1847 


1Q8 


84 


318 


P60839 


Homo sapiens 


Sequence of human serum albumin 


175 


50 








(HSA) on plasmid pXL53. 






319 


W46424 


Homo sapiens 


Human macrophage stimulating protein 


257 


69 








(MSP). 






320 


AL049547 


Homo sapiens 


dJ34F7.2 (CREB-RP (G13)) 


247 


64 


321 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


66 








6613. 






322 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


48 


323 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


324 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


126 


44 








similar to pogo element 






325 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


59 






- 


4435. 






326 


R59842 


Homo sapiens 


ApoE4Ll protease. 


95 


60 


327 


W48351 


Homo sapiens 


Human breast cancer related protein 


84 


61 








BCRB2. 






328 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


123 


66 








4435. 






329 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


65 








4435. 






330 


AF1 19S5S 


Homo sapiens 


PRO! 847 


121 


80 


331 


L78671 


Homo sapiens 


CoxII/D-loop DNA fusion protein 


364 


71 


332 


AK000496 


Homo sapiens 


unnamed protein product 


145 


41 


333 


D00570 


Mus musculus 


open reading frame (196 AA) 


153 


53 


334 


AF1 19855 


Homo sapiens 


PR01847 


116 


74 


335 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


50 








7871. 






336 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 


129 


56 








oncoprotein. 






337 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


107 


59 








7075. 






338 


Y0115S 


Homo sapiens 


Secreted protein encoded by gene 1 8 


115 


72 








clone HCACJ81. 






339 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


117 


50 






brucei 


complete virion genome. 






340 


AK022217 


Homo sapiens 


unnamed protein product 


127 


70 


341 


U43360 


Peromyscus 


reverse transcriptase 


115 


75 






maniculatus 








342 


AF 11 8086 


Homo sapiens 


PR01992 


141 


73 


343 


X92485 


Plasmodium 


pval 


96 


59 






vivax 








344 


AF 106677 


Drosophila 


dissatisfaction 


90 


48 






melanogaster 








345 


U 12693 


Homo sapiens 


cytochrome oxidase subunit IT 


239 


91 


346 


L27428 


Homo sapiens 


reverse transcriptase 


95 


56 


347 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


69 








7795. 






348 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


81 


51 








6613. 






349 


X51616 


Volvox carteri 


SULFATED SURFACE 


110 


41 








GLYCOPROTEIN 185 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


350 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


158 


55 


351 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


183 


60 


352 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich, protein 


151 


51 


353 


R95913 


Homo sapiens 1 


Neural thread protein. 


95 


56 


354 


AF061340 


Artibeus 
jamaicensis 


cytochrome c oxidase subunit HI 


346 


70 


355 


AF090895 


Homo sapiens 


PRO0I17 


126 


60 


356 


AFO 16099 


Mus rnusculus 


endonuclease/reverse transcriptase 


121 


48 


357 


AF1 18086 


Homo sapiens 


PR01992 


159 


73 


358 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


85 


89 


359 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


50 


360 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


110 


57 


361 


M13100 


Rattus 
norvegicus 


unknown protein 


122 


34 


362 


Y36203 


Homo sapiens 


Human secreted protein #75. 


108 


63 


363 

* 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


83 


75 


364 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


212 


65 


365 


AF 130051 


Homo sapiens 


PRO0898 


136 


71 


366 


AF068294 


Homo sapiens 


HDCMB45P 


188 


65 


367 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


225 


70 


368 


S80119 


Rattus sp. 


reverse transcriptase homolog 


188 


45 


369 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


75 


48 


370 


AF1 18082 


Homo sapiens 


PRO1902 


98 


79 


371 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


106 


40 


372 


AF014903 


Pan troglodytes 


NADH dehydrogenase subunit 2 


169 


41 


373 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


93 


48 


374 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


66 


375 


G03107 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7188. 


90 


80 


376 


U93568 


Homo sapiens 


putative pi 50 


140 


56 


377 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


140 


50 


378 


AF090942 


Homo sapiens 


PRO0657 


154 


66 


379 


U93568 


Homo sapiens 


putative pi 50 


149 


36 


380 


U93570 


Homo sapiens 


p40 


184 


57 


381 


L27428 


Homo sapiens 


reverse transcriptase 


128 


60 


382 


AF 194537 


Homo sapiens 


NAG13 


114 


35 


383 


AF1 16712 


Homo sapiens 


PR02738 


109 


56 


384 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


405 


81 


385 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


116 


63 


386 


M22334 


Homo sapiens 


unknown protein 


124 


39 


387 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


97 


52 


388 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


327 


94 


389 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


157 


70 


390 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


62 


391 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


259 


66 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








^A^a tf*rm o n 

'i <1 ICI ((Kill 


Irian tit 










Score 


y 


392 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


63 








4658. 






393 


U43360 


Peromyscus 


reverse transcriptase 


129 


54 






maniculatus 








394 


AK023582 


Homo sapiens 


unnamed protein product 


148 


46 


395 


M22332 


Homo sapiens 


unknown protein 


128 


41 


396 


AF1 18086 


Homo sapiens 


PRO 1992 


160 


71 


397 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


52 








6444. 






398 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


199 


66 


399 


U49973 a 


Homo sapiens 


ORF1; MER37; putative transposase 


234 


78 








similar to pogo element 






400 


AL3 59782 


Trypanosoma 


probable similar to ring-h2 finger 


107 


40 






brucei 


protein rhala. 






401 


AF000996 


Homo sapiens 


ubiquitous TPR motif, Y isoform 


116 


61 


402 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


145 


37 


403 


AF1 18082 


Homo sapiens 


PRO1902 


97 


55 


404 


AF202635 


Homo sapiens 


PP1200 


126 


55 


405 


V00662 


Homo sapiens 


cytochrome oxidase I 


352 


68 


406 


AF229067 


Homo sapiens 


PADI-H protein 


129 


71 


407 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


197 


38 






major 








408 


L26251 


Trypanosoma 


CR5 


95 


46 






brucei 








409 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


310 


62 








similar to pogo element 






410 


X92485 


Plasmodium 


pval 


96 


68 






vivax 








411 


M64793 


Rattus 


salivary proline-rich protein 


128 


40 






norvegicus 








412 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


431 


83 


413 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


299 


86 


414 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


220 


81 








SEQ ID NO: 163. 






415 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


296 


89 


416 


X58438 


Mus museums 


proline rich protein 


112 


50 


417 


U70932 


Peromyscus 


reverse transcriptase 


89 


51 






leucopus 








418 


V00662 


Homo sapiens 


cytochrome oxidase III 


200 


84 


419 


AF017789 


Homo sapiens 


putative transcription factor CA150 


120 


41 


420 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


183 


69 


421 


AL3 59782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna ? 


166 


44 






brucei 


complete virion genome. 






422 


AF130051 


Homo sapiens 


PRO0898 


158 


59 


423 


R96800 


Homo sapiens 


Human histioeyte-secreted factor HSF. 


86 


52 


424 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 3 


342 


82 


425 


U93570 


Homo sapiens 


putative pi 50 


133 


41 


426 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


165 


67 








SEQ ID NO: 163. 






427 


D13951 


Nicotiana 


extensin precursor 


140 


42 






tabacum 






428 


L27428 


Homo sapiens 


reverse transcriptase 


104 


34 


429 


R95913 


Homo sapiens 


Neural thread protein. 


118 


49 


430 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


206 


97 








SEQ ID NO: 163. 






431 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


88 


55 


432 


Y12713 


Mus musculus 


Pro-PoI-dUTPase polyprotein 


98 


54 


433 


J05042 


Oryctolagus 


alpha- 1 (VIII) collagen precursor 


91 


48 



140 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


v 






cunicuius 








434 


G03267 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


105 


56 








7348. 






435 


U93572 


Homo sapiens 


putative pi 50 


118 


38 


436 


U93569 


Homo sapiens 


putative pi 50 


100 


30 


437 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


81 








7437. 






438 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


166 


71 








6613, 






439 


U52077 


Homo sapiens 


mariner transposase 


187 


52 


440 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


80 


45 








7214. 






441 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


71 


442 


AE003727 


Drosophila 


CG 1 671 8 gene product 


301 


48 






m elanogas ter 








443 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


221 


74 








clone HPMBQ32. 






444 


U35730 


Mus musculus 


jerky 


159 


26 


445 


X53581 


Rattus 


ORF3 


192 


46 






norvegicus 








446 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


142 


52 








4497. 






447 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


156 


38 








similar to pogo element 






448 


AF 194537 


Homo sapiens 


NAG 13 


315 


70 


449 


W48351 


Homo sapiens 


Human breast cancer related nrotein 


93 










BCRB2. 






450 


X92099 


Brugia pahangi 


collagen 


126 


44 


451 


AF090930 


Homo sapiens 


PRO0478 


88 


60 


452 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


93 


40 








SEQIDNO:163. 






453 


AF081114 


Mus musculus 


ORF2 


108 


32 






domesticus 








454 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


173 


65 








787 1. 






455 


S80119 


Rattus sp. 


reverse transcriptase homolog 


197 


54 


456 


G02535 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


68 








6616. 






457 


R95913 


Homo sapiens 


Neural thread protein. 


114 


48 


458 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


160 


39 


459 


X92485 


Plasmodium 


pval 


99 


52 






vivax 








460 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


100 


52 








8172. 






461 


S80119 


Rattus sp. 


reverse transcriptase homolog 


138 


48 


462 


Y91577 


Homo sapiens 


Human secreted protein sequence 


211 


67 








encoded by gene 2 SEQ ID NO:250. 






463 


X97707 


Pongo pygmaeus 


stopcodon created by 


229 


76 






abelii 


posttranscriptional polyadenylation 






464 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


119 


67 


465 


L27428 


Homo sapiens 


reverse transcriptase 


154 


40 


466 


AK000496 


Homo sapiens 


unnamed protein product 


140 


69 


467 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


114 


61 








4718. 






468 


G00589 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


146 


69 








4670. 






469 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


286 


79 


470 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


86 


471 


M10546 


Homo sapiens 


cytochrome oxidase I 


296 


79 



141 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


472 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


102 


48 




AT 


LllClllCiliCl 


puLauvc snjtviNJr proiem 


1 u^> 


4Z 


474 


X99452 


Lycopersicon 

p<>f t 1 1 prt ti i m 


extensin-like protein Dif54 


108 


25 


475 


G03133 

V_I \J ~J JL -J *J 


Homo ^anipns 


Humeri iprrpfpH nrotpin ^T-i'O IT") "WO* 

7214. 


uo 




476 


ABO 12223 


Canis familiaris 


ORF2 


78 


66 


477 


AF 13 0089 


Homo saoiens 

JL X.\JXXX\J kJt^L/XVlikJ 


PRO2550 


113 


71 

f L 


478 


G03652 


1"Torno ^anipn^ 


Human Qprrptpd rvrotpin SlPO ID NO* 

7733. 




Q7 


479 


AF2 10651 


Homo sapiens 


NAG 18 


146 


80 


480 


AB029309 


Homo sapiens 


Npw38-binding protein NpwBP 


103 


40 


481 


AF 194537 


ffomn ^finipn*? 


NAG 13 


no 




482 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


115 


45 


483 


T 97478 




ICVClbC Li dlliL-i IjJ LcibC 


1 Rzl 


"=+ / 


484 


U93570 


Homo sapiens 


putative pi 50 


101 


50 


485 


AF1 04*517 

t\-\- L 7T-JJ / 


XTUII1U IscipiCIlb 


vrAr;i 1 

1N/-VVJ 1 _J> 


o 1 
Zl J> 


DZ 






x JLoriio sapiens 


jjuman sccreieu proxem JLaL^nrroo, 
SEQ ID NO: 163. 




i>Z 


"O / 


I OOZ'rO 


I— T/~\111 r\ convene 

noiiio sapiens 


xiuman secrexea protein oL^jnJrroo, 
SEQ ID NO: 163. 




oo 


HOC 




Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 






4550 


ttqi^74 


jtaomo sapiens 


putative pi 50 


50 




490 


W48351 


Homo sapiens 


Human breast cancer related protein 

DviVDZ. 


83 


56 


491 


AJ271872 


Nicotiana 

c\/l cfr*t c 


extensin 


220 


47 


4Q? 


TT1 1988 


1 1 t*^~\ C /*\ 1^ l"\ 1 1 d 

uruaupiiiid 
in p 1 fin n cr a pr 

-ill C-i dll vJ£J U5 tcu 


uiapriaiious proxeui 


1 1 j 




493 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

81 79 


184 


70 


494 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

770S 


131 


68 


495 


AF1 19900 


Homo sapiens 


PR02822 


148 


65 


406 




HniriA cjarxf on c 

xjluiiiv/ bapieiio 


vv a jr~ i dii 1 1 1 y proiein 




jo 


497 


D86853 


Catharanthus 

1 UoCUo 


extensin 


104 


34 


498 


AF09S467 


^ CtC 11 \Jl 11 CtULlillb 

elegans 


i**f\ty "fad 1 1^1 C C 1 1 T>"^ 1 Iq1"1 4*\ 7 "f /*\ nt^AP A'rtnil'!! Ti\T A 

^UULctlilb DUIllicti ILy l\J urUbOpillla JL/lN/\" 

binding protein K10 (NID:g8148) 


1 00 


4 / 


499 


GO 0497 


intjiiivJ bcipicitb 


li ui ii aii scLreLeu prutem, ojDv^ iju invJ. 
457^ 


1 AO 

ivy 


o / 


500 


G00416 


Homo <; aniens 


Human <ipct*ptpd r^rofpin 9FO TD TvTO* 
4497. 






501 


AF1 19901 


WotT>fl QftTlipffQ 
x jujiiivj aajpiciio 




1 1 u 


87 
oZ 


502 


AF238235 


T* vii 3m npKa 

l_fl 1 id* 1 1 & 1/ cl 

histolytica 






41 

H i 


503 


M22332 


Homo ^aniprtR 


nnVnown nrotp-in 

UillJVlIVJ W 11 Lslvrlwlll 


17^ 




504 


AF119851 


Homo sapiens 


PRO 1722 


204 


52 


SOS 


X61796 


JCvallUb 

norvegicus 


open reaumg irame z 


1 0^7 




506 


AF 11 8082 


Homo sapiens 


PRO 1 902 


145 


52 


507 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


68 


508 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


175 


39 


509 


X92485 


Plasmodium 


pval 


117 


52 



142 



WO 01/64835 



PCT/USO 1/04927 



SEQID 
NO: 


Accession No. 


Species 
— _ 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






VIVaX 








510 

Jiv 


AF0QQ949 


rlOinO bapicilb 


PT?r\A^5T 
xrUUDj / 


yj 


I 1 


51 1 




xioino sapiens 


puiative pi9U 


1 9 A 


j4 


512 


U93574 


Homo sapiens 


putative pi 50 


140 


49 


513 


A T949540 


voivoa Careen i. 
nagariensis 


hydroxypro line-rich glycoprotein DZ- 
HRGP 


i 70 


OD 


514 
Jit 


T 9749 ft 


jtiomo sapiens 


reverse transcriptase 






j l j 


VJ 7JJUJ 


rioiiio Sapiens 


i^n to ■Hires r-*1 KCl 

puidiive pi3U 


1 A 1 


40 


516 


TTQ3574 


noino sapiens 


puiduve piz>u 




J J 


517 


G02753 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Oo34. 


81 


27 


518 


AF053538 


Alvinella 
pompejana 


fibrillar collagen chain FApl alpha 


112 


36 


51 Q 


V599'3^ 


Homo sapiens 




14a 


35 


520 


AF130051 


Homo sapiens 


PRO0898 


98 


61 


JZ 1 


T AO 1 A£ 


Drosophila 
melanogaster 


ribonucleoprotein 


143 


40 


522 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
Jo Jo. 


202 


70 


523 


U93570 


Homo sapiens 


putative pi 50 


159 


43 




y oox4& 


Homo sapiens 


Human secreted protein HCHPF68, 
obQ ID NO:lo3. 


210 


100 






Homo sapiens 


reverse transcriptase 


128 


38 




nn 1 9/i £ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


96 


65 


^97 




Kattus 
norvegicus 


OKr4 


130 


42 


59 ft 


T TQQ <I7A 


Homo sapiens 


putative pi 50 


195 


35 


59 Q 


AT71 "5Aft5Q 


Homo sapiens 




132 


43 


530 


AK024455 


Homo sapiens 


FLJ00047 protein 


126 


54 j 


03 1 


vjrlL£o3y 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


123 


61 






Homo sapiens 


OKF 1 ; MJbR37; putative transposase 
similar to pogo element 


210 


44 




A I7A 1 A 1 ylyl 


Homo sapiens 


neuronal thread protein AD7c-NTP 


109 


42 


534 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


120 


62 




Mo 4 / y3 


Rattus 
norvegicus 


salivary proline-rich protein 


124 


37 


536 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


122 


50 


537 




— — ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 

471 ft 


118 


4^> 


538 


A23786 


Beta vulgaris 


chitinase 1 


91 


33 


530 


AP01 01 44 


noino sapiens 


neuronal tnreau protein au / c-iN 1 Jr 


1 ir 
1 10 


jy 


540 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Qt 70 

iSl JZ. 


102 


67 


541 


S80119 


Rattus sp. 


reverse transcriptase homoiog 


191 


50 






Homo sapiens 


Human histiocyte-secreted factor HSF. 


155 


77 


543 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
DyKJy. 


125 


91 




W453M 


— — ; 

Homo sapiens 


Human breast cancer related protein 
RCR R9 


146 


62 


545 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


74 


45 


546 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HCACJ81. 


94 


75 


547 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


510 


86 


548 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 


146 


68 



143 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








RHAla(AF078683) 






549 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


128 


82 


550 


B03628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


153 


77 


551 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


121 


56 


552 


X92485 


Plasmodium 
vivax 


pval 


103 


50 


553 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


129 


46 


554 


L27428 


Homo sapiens 


reverse transcriptase 


149 


44 


555 


AF 194537 


Homo sapiens 


NAG 13 


157 


45 


556 


Y13247 


Homo sapiens 


FBI 9 protein 


106 


42 


557 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


100 


54 


558 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


176 


68 


559 


S80119 


Rattus sp. 


reverse transcriptase homolog 


113 


43 


560 


AY008270 


Homo sapiens 


cholestery 1 ester transfer protein 


107 


95 


561 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


63 


562 


U49973 


Homo sapiens 


ORF 1 ; MER37; putative transposase 
similar to pogo element 


347 


68 


563 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


93 


40 


564 


D38114 


Gorilla gorilla 


cytochrome c oxidase subunit 3 (COIII) 


329 


74 


565 


Y36156 


Homo sapiens 


Human secreted protein #28. 


153 


56 


566 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


131 


48 


567 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


406 


94 


568 


AF130079 


Homo sapiens 


PR02852 


101 


55 


569 


AF081114 


Mus musculus 
domesticus 


ORF2 


123 


40 


570 


L22030 


Glycine max 


hydroxyproline-rich glycoprotein 


65 


45 


571 


D86853 


Catharanthus 
roseus 


extensin 


168 


39 


572 


AF1 04415 


Mus musculus 


gene trap locus- 13 


179 


66 


573 


AF 130089 


Homo sapiens 


PRO2550 


114 


56 


574 


X67863 


Mus musculus 


T2 


115 


42 


575 


S80119 


Rattus sp. 


reverse transcriptase homolog 


101 


28 


576 


S80119 


Rattus sp. 


reverse transcriptase homolog 


150 


57 


577 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


142 


74 


578 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


106 


57 


579 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


46 


580 


L24521 


Homo sapiens 


transformation-related protein 


110 


38 


581 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


537 


84 


582 


AF090895 


Homo sapiens 


PRO0117 


127 


80 


583 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


145 j 


70 


584 


X55681 


Lycopersicon 
esculentum 


extensin (class I) 


112 


38 


585 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


473 


60 


586 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. ! 


144 


68 


587 


U47855 


Araneus 
diadematus 


fibroin-3 


124 


30 


588 


AFO 10 144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


150 


75 


589 


U93567 


Homo sapiens 


putative pi 50 


225 


47 



144 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 


% 

Idcntit 

y 


590 


X71602 


Nicotiana 
tabacum 


extensin 


147 


33 


591 


X57527 


Homo sapiens 


alpha l(VIII) collagen 


103 


42 


592 


G031 12 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7193. 


75 


47 


593 


R28916 


Homo sapiens 


Type III procollagen (prior art). 


116 


48 


594 


R95913 


Homo sapiens 


Neural thread protein. 


116 


37 


595 


U11880 


Petromyzon 
marinus 


cytochrome oxidase subunit I 


127 


52 


596 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


84 


62 


597 


L27428 


Homo sapiens 


reverse transcriptase 


158 


40 


598 


M55251 


Canis familiaris 


fflvconrotein 80 


559 


86 


599 


AP000616 


Oryza sativa 


similar to RrNCr-H'? finder nrotein 
RHAla (AF078683) 


132 


100 


600 


Ml 8094 


Phaseolus 
vulgaris 


hvdroxvnroline-rich crlvconrotein 


143 


33 


601 


G03411 


Homo sapiens 


Human secreted nrotein SEO TD NO* 
7492. 


154 


54 


602 


X73481 


Drosonhila hvHei 




107 


47 


603 


M81321 


Macaca 

ffi Qr*i pi 1 1 nri q 


proline-rich protein 


114 


39 


604 


X05561 


Homo sapiens 


alpha- 1 chain precursor (A A -27 to 
91 7 1 is ?nd base in rodorA 


109 


42 


605 


G03203 




Hi lm n n Qf^prp'tprl m'otp in QPD TO TvIO' 

1XU.111CH1 JvUl C LC Vi Lfl KJ l\Z 111 j OJ_!/V^ XX-J IN 

7284. 


ft? 




606 


G03203 


T-Tnmn <sarviens 


l-himan cieereterl nrotein *sF O TD HMO* 

X X\XxXX(Xi.X JV^l ulvU LJl\JL\jlLXy k?Jt_*V^ XX-J H V. 

7284. 


83 


64 


607 


AL3 59782 


Trvnannsoma 

brucei 


nrnhah]f» similar tn rtn cr-li 7 firtcrpr 

protein rhala. 


1 07 

i \j i 


44 


608 


L27428 


Homo sapiens 


reverse transcriptase 


147 


43 


609 


G00357 


ffomn sarviens 

X X\JXXX\J OClJJiVllO 


Human secreted nrotein ^PO ID T\fO* 

X JL Hill till jwl vluu Lfi \J Lb 111 j k_7 1 AX-/ 1 \ V_-/ . 

4438. 


1 13 


61 

VJ x 


610 


Y86248 


Homo sarviens 

XX\JXXX\J OUUJIVUJ 


Human secreted nrotein HCT-TPP6S 

XXlAXXXtXll iJWlVlvU Ul VfLV^lll. 11V/1 11 X UUj 

SEQ ID NO: 163. 


151 


82 


611 


U93568 


Homo saiTiens 


nutative nl SO 


144 


32 


612 


AB022223 


Arabidopsis 
thaliana 


QxtQnsin protein-like 


186 


58 


613 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 


49 


614 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


133 


36 


615 


X92485 


Plasmodium 
vivax 


pval 


120 


42 


616 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


175 


89 


617 


U83280 


Leishmania 
donovani 


39 kDa antigen 


111 


51 


618 


AL 160493 


L eishmania 
major 


nrobable fhhv-6^ nil 02 variant a 
DNAj complete virion genome 


137 


67 


619 


G02532 


Homo sapiens 


Human secreted protein 3 SEQ ID NO: 
6613. 


120 


53 


620 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


92 


45 


621 


AF130089 


Homo sapiens 


PRO2550 


123 


34 


622 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


133 


59 


623 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


127 


45 


624 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


205 


66 



145 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


625 


X07882 


T-Tomo Qarvif^nQ 


i \J ]Jl KJ LCI 1.1 


1 19 




626 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SFO TO NO- 163 


70 


100 


627 


G00637 




Xluilldll oCl^ICLCU ^JlUtCIllj JLv\/ JJ-V JlMV-'. 

4718. 


141 


D I 


628 


G00637 


Homo sapiens 


Human secreted protein SEQ ID NO: 
4718. 


141 


54 


629 


G03786 


Homo sanien^ 


Human sprrpted nrrvtain SVRO TP) NO* 
7867. 


115 




630 


X6336S 


Homo sapiens 


HSJlb 


151 


52 


633 


AF130089 


Homo sapiens 


PRO2550 


155 


47 


632 


X92485 


Pla^m nrHnm 

vivax 


pval 


1 0? 


61 
t> i 


633 


K03205 


Homo ^anien^ 


^plivflrv rvroli np*-r if^H nrotein nrpnir<;nr 


102 




634 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

cim j Jar to nncrn pi pnipnt 


171 


74 


635 


X92485 


Plasmodium 
vivax 


pval 


95 


73 


636 


S80119 


1VUILUJ }~ " 


tpver?p tr?ir»RpriT>+?i"?p Vinmolncr 


1 14 


JO 


637 


U15647 


Mns musculus 


reverse transcriptase 


170 


42 


638 




xx\JHl\J cxXjJlClib 


xiuiiiau accrcicu protein, ocy lu inv-/. 

7124. 


1 ^0 


if,. 




G00637 


xjluiiiu aapicns 


Wnman cf^fria+f^rl nrn+pin QCA TT~S "NT/~\- 

nuiiiaii occrcica. proxcm, ocv,/ lu inv-J. 
4718. 


OJ 


DO 


640 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


96 


34 


641 


X61296 


Rattus 


open reading frame 2 


166 


35 


642 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

443 3 


127 


35 






^acnorndDuiiis 


similar to myoblast cell surface antigen 
fcp.pcoA TTTTMATM P03946 > i flT iH D 

melanogaster No-on-transient A protein 
(PTR-TH016TI 

^JL A J. V.J A i.U J KJsL. J 


1 1 c 
1 lJ 


i < 


644 


AF081 111 


A/Tii^ mi iQfi iliw 

dome sticus 


ORF? 

V/J.V1 


1 ftR 
1 uo 




645 


AK027208 


Homo sapiens 


unnamed nrotein nrodu ct 


90 


51 


646 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


101 


59 


647 


Y86248 


x x\jin.\j japi^iio 


Unman <?prrf>teH nrrvf-ein I-TPTTPT76R 

JHLH.lJLu.il oCUlOLCLi JLilVJLClll xx\-*xxx C t/O, 

SEQ ID NO: 163. 


it/ 


76 
/ o 


648 


AF273441 


Pongo pygmaeus 


NADH dehydrogenase subunit 3 


121 


58 


649 


L27428 


Homo saTjieim 

X XyJXXXVJ 3 CI L_/ JLvllO 


reverse tran^crintacip 


173 




650 


AF119851 


Homo saniens 


PRO 1722 

X iVV_/ A / f - 


176 




651 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded bv ffene 2 SRO TD NO* 1 25 


168 


68 


652 


AF130089 


Homo sapiens 


PRO2550 


130 


36 


653 


W50192 


Homo sapiens 


Amino acid seauence of salivarv 
protein CON-1. 


126 


41 


654 


AK000385 


Homo sapiens 


unnamed nrotein rvroduct 


195 


63 


655 


AB041881 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


158 


100 


656 


X61047 


Hydra sp. 


mini-collagen 


60 


38 


657 


M22332 


Homo sapiens 


unknown nrotein 


100 


50 


658 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


143 


62 


659 


AF1 94537 


Homo sapiens 


NAG13 


95 


48 


660 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


50 


661 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


86 


32 


662 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


169 


53 


663 


W48351 


Homo sapiens 


Human breast cancer related protein 


120 


72 



146 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








r3UKJb>Z. 






ot>4- 


U 1364 / 


Mus musculus 


reverse transcriptase 


1 A £ 
145 


50 


665 




Homo sapiens 


Neural thread protein. 


lol 


59 


666 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/UU1 . 


134 


80 


667 


G03267 


Homo sapiens 
— — 


Human secreted protein, SEQ ID NO: 

/545. 


117 


44 


005 


A\oU 15/ U 3 


Mus musculus 




1 1 c 

1 13 


5Z 


669 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


280 


75 


670 


A5r>5ol 


Rattus 
norvegicus 


OKr4 


71 


39 


671 


U49973 


Homo sapiens 


ORF 1 ; MER3 7; putative transposase 
similar to pogo element 


201 


66 


6/2 




Homo sapiens 


Neural thread protein. 


1 A A 

144 


78 


6/3 


T T AC\C\H^ 

U499/3 


Homo sapiens 


OKr 1 ; MER37; putative transposase 
similar to pogo element 


164 


46 


674 


A T? 1 1 OAOO 

Ar 1 15082 


Homo sapiens 


FRO 19 02 


137 


49 


675 


Cj031j3 


Homo sapiens 


Human secreted protein, bEQ ID NO: 
/Zl4. 


91 


A "J 

43 


o/o 


jJuuj /u 


Mus musculus 


open reading frame (25 1 A A) 


1 io 

1 iZ 


f z 


Off 


ACT O/f f ? 7 

Ar iy43J / 


Homo sapiens 


TVT A S~ 1 t 1 

JNACjli 


ZOO 


56 


o/o 




Kattus 
norvegicus 


unknown protein 


146 


5 1 


o/y 


U 15647 


Mus musculus 


reverse transcriptase 


1 OO 

123 


£ A 

54 


/' r> pv 


Ky^y i j 


Homo sapiens 


Neural thread protein. 


143 


53 


681 


R59842 


Homo sapiens 


ApoE4Ll protease. 


107 


63 


6oz 




Homo sapiens 


PKO2550 


94 


5 1 


683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


129 


69 


684 


AF093748 


Homo sapiens 


KH type sphcmg regulatory protein; 
KSRP 


93 


50 


685 


U93569 


Homo sapiens 


putative pi 50 


1 J>J) 


58 


686 


G03095 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7176. 


137 


64 


687 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


69 


73 


655 


A "C? 1 1 £H 1 O 

Ar 1 1 6 / 1 Z 


Homo sapiens 


T>r> r\T77 o 
PK02735 


loo 


56 






Homo sapiens 


Human secretea protem, £>EQ ID NO: 


lj)4 


53 






— — : 

Homo sapiens 


putative pi 50 


1 JZ 


4V 


691 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


349 


70 


oyz 




— — - : 

Homo sapiens 


t>t> /"\f \ i in 


1 13 


63 




J\r UUU07 


Homo sapiens 


riSXJZDJKJ 


1 

1 JZ 


oil 


£0/1 


QOA1 1 O 


Rattus sp. 


reverse transcriptase homolog 


1 ri i 
lU 1 


4o 


695 


U 15647 


Mus musculus 


reverse transcriptase 


120 


64 




KjV3 /59 


Homo sapiens 


Human secretea protein, £>EQ ID NO: 

/o/U. 


1 


CA 

59 


697 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


72 




A "C A Mil A A 

ATU1U144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


loz 


An 
4 / 




Looi \Z 


Homo sapiens 


cytochrome c oxidase subunit 1 


459 


no 


/□(J 


UUU4U3 


Homo sapiens 


Human secretea protem, oEQ ID NO: 

A AO A 


1 AO 

145 


73 


701 


AF003535 


Homo sapiens 


ORF2-like protem 


125 


49 


702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g814S) 


89 


41 


703 


L27428 


Homo sapiens 


reverse transcriptase 


255 


50 


704 


AF130089 


Homo sapiens 


PRO2550 


87 


55 


705 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


113 


52 



147 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7133. 






70/^ 
/uo 


V7Q1 A(\ 


Homo sapiens 


Human haemopoietic stem cell 
regulatory protein SCM3. 


O 1 1 
Z 1 I 


88 


707 


T T1 5647 


iyius muscmus 


reverse transcriptase 


OA 


4 / 


708 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone ri 1 ij/\j_jzz. 


153 


66 


700 


\J\JD<y KJ / 


nonio sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


LU/ 


CA 


710 


TTAQQ73 


norno sapiens 


L/jtvr i , ivir,Kj i , putative transposase 
similar to pogo element 


1 OJ 


£LA 

64 


71 1 


H03714 


numu SdpicilS 


xiuinan secreieu protein, oJtiv^ ilj invJ. 
7795. 


1 APk 
140 




71? 


TT03565 


JTUIIIU Sd.JJlCllb 


pULatlVc piZ>U 


1 nsz 

1 Uo 


zo 


713 


H031 14. 


XaUllHJ bapiciib 


Tinman C"/a>r»-r»<s>+oi/~l v\ ilf* Q IZ? f\ TT\ "MY"^* 

jiuman secrexea protein, oDy ijl' anvj. 
7195. 


1 OQ 
1 Uo 


Z) / 


714 


1VT9473? 


riorno sapiens 


lamin-like protein 




Q< 
DD 


715 


D3&1 1 ? 

JL/_J Ol 1Z 


nonio sapiens 


cytocnrome c oxiuase suounn d 




7Q 


716 


U97674 


Mesocricetus 
auraius 


cytochrome c oxidase chain I 


462 


85 


717 


A T7A0A71 5 
/\r UUh- / 1 j 


Homo sapiens 


— — . _ _ j 

jerky gene product homo log 


1 (\c\ 
too 


4z 


718 


X92485 


Plasmodium 
vivax 


pval 


84 


48 


719 


AF130089 


Homo sapiens 


PRO2550 


132 


74 


/ZU 


AT ylCl An 

AL45 1 U I / 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


125 


43 


T") 1 

/Zl 




Homo sapiens 


rLJUUU4/ protein 


1 AO 

108 


68 


722 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

11 14. 


146 


46 


723 


X83413 


Human 
herpesvirus 6 


U88 


269 


41 


724 


X92485 


Plasmodium 
vivax 


pval 


117 


43 


77 <s 
/Zj 


VQO/1525 


Plasmodium 
vivax 




pval 


97 


A 1 

41 


77 A 
/ZO 


no7^3°. 


Homo sapiens 


Human secreted protein, b.bQ 1L> NU: 
6619. 


lOz 


Id 


/Z / 


LjUUOJ / 


Homo sapiens 


Human secreted protein, bJbQ ID 1NU: 
4718. 


1 1 O 

118 


43 


70R 
/Zo 


AZJ / oO 


Beta vulgaris 


chitinase 1 


A 1 

91 


33 


729 


W48351 


Homo sapiens 


Human breast cancer related protein 


107 


49 


730 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ ivz>. 


131 


59 


731 


WzlQ7 1 7 


xiomo sapiens 


Protein polymer adhesive substrate 
PPAS1-C. 


1 A Q 


OA 

zy 


739 


W501 03 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 




45 


733 

I DD 


VQ/^73 1 

2\y 0 / ^> 1 


Ostertagia 
circumcincta 


cuticular collagen 


1 A/l 

104 


37 


1 JH 


AF1 300RQ 


Homo sapiens 


PT? rv3 5^o 


1 1 Q 


>1 A 
40 


735 


H07755 


riomo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


loo 


1 A A 
100 


736 


Y91577 




in.U-lLia.LX bCUCLCU ^JlUlCill bCL£UCllUC 

encoded by gene 2 SEQ ID NO:250. 




Sol 

oO 


737 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


188 


69 


738 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


593 


91 


739 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


102 


62 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


740 






XlUIIlail aeUlClCCl UrUlClIl, OjL/V^ ILJ 1N\_/. 

7284. 


1 1 1 
ill 


J4 


74] 


A23786 


Rf^ta vulcraric 


till cii>C X 


1 Oft 


■^A 
jO 


742 


G00354 




T-T » j rn .<} r» cw^^fv^rl nt-r^fv^ir* Q (~\ TT~V 7\JO* 
XJ.U111C11I oCl/£ClCli \jL\Jl\jlLly OJ-»V«/ INL/. 

4435. 


71 


79 


f-T-> 




T— f r^t"n r\ canipnc 
XiUIJUJ odpiCIlb 


L^Jtvr 1, IVliiJKj /, puicLLlvG irallbpObaoC 
Qimilflr 1~r\ r*nor> plf^mpnt 


1 R4 


oi 


744 


AE003629 


Drosophila 
melanojjpi^tpr 


CG17108 gene product 


76 


36 


745 


U93563 


Homo sapiens 


putative pi 50 


145 


50 


746 


VJU 1 7J 1 


T-T/~vmr\ c oni f*v\ c 
JiUlIIU iJaplClls 


nuiiiaii Ssccrcicu proLcin, any iu inw. 
6012. 


Oo 




747 


AF91 797*3 


T-Trvtn r\ ccimpnc 
JTLUlllU ocipiCllo 


UI1M1UW 1 1 


1 1 j , 


7Q 


748 


AC006161 


Arabidopsis 

Llldllalla 


putative CENP-B/ARS-binding 
protcm~iiKe piOLcin 


112 


31 


749 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7077 


106 


47 


750 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 

xvrlAia ^/VrU /oOcoJ 


115 


54 


7S1 


no9ziRO 

UUZ'+OU 


nomo sapiens 


xiumaii secrereu protein, oJC/^ ijl^ in la 
6561. 


no 


fZ 


7S9 




jnLonio sapiens 


riumaii secrexea proiem, oxi^ iu invj. 






753 




r iciii ii uu nil 1 1 

vivax 






7^ 


7S4 


U93563 


T-Tr\mr» c^m'pnc 




1 Rft 

1 OL) 


Do 


7SS 


AP00061 6 




biiiiiiar to iviiN vj-riz, linger proicin 
RHAla(AF078683) 


1 1 6 
1 1 o 


^O 


756 


AF1 Q4S37 


T-Trvfv>f\ oofMPnc 

jriuiiitj bdpiciib 


lN/\0 I -J 


1 

l^o 


40 


757 


AFOI 01 44 


JC1U1 11 U 0<3JJ±011Z> 


iicui Uiial Uli Call pi ULCiii r\i_v / u-iN JL Jr 


119 
1 S.Z. 


70 


758 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

707S 


119 


61 


759 


AF1 30079 


T- Ffvm f\ cnni c 
XlUHlvJ odpiCllb 




1 

IjO 


40 


760 


X92485 


Plasmodium 

VlVdA 


pval 


88 


77 


761 


U93050 


A/fnc mil cr^nlnc 


jJ\Jiy\r\J UillUJLilg plUlClll 11 






762 


V01555 


JTLlllllClll 

rifyrnf^vim 5 ! 4 

Jt.i.V<J. ywiJ V 11 LtiJ T 


RVRP1 p^n^rvH^t; PRATA-7 ^Fiamhaiicrf* 
et al 1984* Dillner et al \<)R4, 

Cllj 1 ^ (JT, XSXlXlX\jl L Cll, _L I / 


£0 


41 


763 


V00662 


Homo ^anien^i 


ATPa^e 6 


337 


83 


764 


X62677 


O rv c t o 1 a !? u 

cuniculus 




1 17 


41 

r 1 


765 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


423 


81 


766 


R95913 


Homo sapiens 


"Wfiural fhreaH Drnfpin 


114 


66 


767 


V00662 


Homo sapiens 


cvtoclirnTYie oxidfi^e T 


223 


83 


768 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


268 


83 


769 


V00662 


T-Frnnrv <??inipn 


L/_y LUUU U'ill L/AJUaot 1 


^7 


R1 


770 


D381 12 


Homo <3anien<i 


7\I A ni-T rlf^hvrlmCTf^naQp cuTMinif 4 

-IN -£i-J — ' X JL UCllj/Ul UgCllttoC U LI 111 L ^+ 




71 
/ 1 


771 


AF026211 


f^apnorTifibrlitiQ 

elegans 


Slimilar tn mtif-n1?ir rrtllao-fn 






772 


A K 07445 S 


rTfYTYlO CflTMPtl? 


FT TO 0047 Tvrrktf* in 


1 OR 
1 uo 


S^ 


773 




JT IdblllU Kll UJULi 

vivax 


pval 




SO 
J7 


774 


AF130051 


Homo sapiens 


PRO089S 


123 


38 


775 


ABO 12223 


Canis familiaris 


ORF2 


174 


51 


776 


AB028664 


Paralichthys 
olivaceus 


cytochrome oxidase subunit-3 


268 


57 


777 


V00662 


Homo sapiens 


cytochrome oxidase I 


436 


85 


778 


X52235 


Homo sapiens 


ORFII 


125 


47 


779 


U93569 


Homo sapiens 


putative pi 50 


235 


49 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


7R0 




T— Trvm rv com on c 

XxOHlO bdpiciib 


OT?Fl* AyTT7T3 1"7 • mifo+n/o trarionAcciop 
v-Jxvr l, l.VlJz / Jtvj> / 3 puialive LrdIlbpObd.be 

similar to pogo element 


9 £9 


/o 


/ O 1 




liVUi <J bctJL/iClIb 


ojvr i, ivixirvJ /, putative udiibpobabe 
similar to pogo element 


9^0 


/U 


7R9 

/ Ox. 


AF0Q0Q49 


mOillU bdpICJIb 


pt?no£S7 

x XvUWJ / 




oy 


783 


G00454 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4S^5 


85 


66 


7R4 


T T4QQ7^ 


XlVJlllL' bajJlCilb 


V^XVT 1 5 IVllJ/XVJ / 3 pucaLivc LrailbpObaSG 
similar tc\ nncro plpmpnt 

iJ.ILli.l ltU l-VJ JJUg^yj Ol^'llll^l J L 


9HQ 


^9 


785 


U49973 


Homo sapiens 


ORF1; MEE137; putative transposase 

cirnil?ir tr» nnrrn p>! pmpnt 


184 


60 


786 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

tiimilar tc\ r\r\ctn pip m ^ n t 


158 


60 


787 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


293 


86 


i do 




xiomo sapiens 


rAUZjJU 


1 J> 3 




7RQ 


UUjH 1 1 


nomo sapiens 


xiuman secrexea proiem, ori^/ ii^ xnvj. 
7492. 


1 ^ 1 


<:7 


7on 


vjrUU.? J4 


xiomo sapiens 


xiuman secretea protein, ojbv,/ 119 iN\9. 


1 UU 


M 




O09QR7 

VjuZ70 / 


rxunio sdpiens 


xxuriidn bcureicci protein, oriv,; 119 xnw. 
/ ooo. 


11^ 
1 1 o 




792 


Y36203 


XiL'iil v7 bctlJiCilb 


Xxuiiictii aCOiClCvi j^lv/lciil ft / J . 


i rid 


77 


793 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 

ISUlUl 111 


117 


58 


794 


AF090930 
rvr \jy\jyD vj 


xxlhiiu ocipiciib 


PRO0478 




oo 


795 


AK024455 


Homo sapiens 


FLJ00047 protein 


114 


61 


796 


AF01 0144 


T-Frvmo ciinipnc 
JL IwIilL/ bdjpiciib 


llCUlOlldl LIU CdU piUtCUl r\LJ /L>-L\ 1 IT 


1*+1 


70 


191 


AF 130089 


Homo sapiens 


PRO2550 


273 


85 


7Q£ 


w oooz / 


nomo sapiens 


Secreted protein encoded by gene 94 

rlnn/=» HPMR0^9 
U/iuiic xxir ±yiji>\£~> z, . 




DO 


199 


AFI 3 0051 


Homo sapiens 


PRO0898 


162 


80 


0\J\J 




XXLHIIU bdJJlCUb 


XI LU 11 all bCUrclcCi piOlClll, OCy iN*wf. 

7641. 


1 TO 
1 ly 


/U 




I 1 H*f / ¥ 


xiomo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


/ _> 


^ 1 




A FOA1 ^O 


Bo do saltans 


]\/\i9ri aenyaiogenase suounit d 


1 ni 


34 




JL/3 o^o^f 


riyiooaies 
syndactylus 


Cytochrome C oxidase subunit 1 
(COXI) 




/U 


ROA 


rioo^^7 


raomo Sapiens 


xiuman secretea protein, oriv^ 1J9 inv9. 
4718. 


1 1 7 
1 1Z 


/1 1 
41 


o v-j 




JridSIIlOUlUIIi 

vivax 


pval 




JO 


806 


AFI Q4S ^7 


noiiiu bajJicHb 




9R^ 


J 1 


807 


AF121360 


Drosophila 


DNZDHHC/NEW1 zinc finger protein 
1 1 


179 


47 


808 


A7-6tOJ 


x laolllUUluill 

vivax 


pval 


ij> i 




809 


D13951 


Nicotiana 


extensin precursor 


88 


41 


810 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 

uiuiic i li n ix± y^fz^y . 


174 


73 


811 


LJ 7jJvj 


xxmiiu bdpioiib 


pu.La.Livc pi 


lis 

1 1 o 


JO 


812 


AFI 18082 


Homo sapiens 


PRO 1902 


88 


54 


813 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


132 


42 


814 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhal a. 


114 


50 


815 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyproteui 


113 


43 


816 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


148 


40 



150 
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SEQ n> 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


1 % 
Identit 

y 


817 


A 1 ^7 A AC\ 

Ar 2 17449 


Schistosoma 
mekongi 


NADH dehydrogenase subumt 6 


102 


37 


818 


AB012223 


Canis familiaris 


ORF2 


101 


54 




A / louz 


Nicotiana 
tabacum 


extensin 


1 /CO 

162 


45 


ozU 


A/1 AO 1 


Homo sapiens 


xiuman secreted protein, ID iMCJ: 
8172. 


1 C A 

1 J»4 


36 


OZ 1 


r> AC A 1 o 


Homo sapiens 


Neural thread protein. 


153 


61 


ozz 


GU035 / 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
4438. 


125 


61 




U4997J 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


364 


57 


oz4 


U433oU 


Peromyscus 
maniculatus 


reverse transcriptase 


1 O 1 

121 


A O 

48 


825 


AF194537 


Homo sapiens 


NAG 13 


224 


58 


ozo 


T TA'7iC'7 A 

U976/4 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


437 


72 


82/ 


a t?ac 1 too 

Ar 05 1782 


Homo sapiens 


diaphanous 1 


108 


38 


ozo 


A 17 1 Ci A C^T 

AJb 194537 


Homo sapiens 


NAG 13 


92 


45 


ozy 


JJJo 1 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


492 


75 


830 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


1 10 


46 


OJ>l 


"VCC/CCC 


Lycopersicon 
esculentum 


extensin (class I) 


108 


31 


o\3z 


i 12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


1 17 


34 


833 


U93564 


Homo sapiens 


putative pi 50 


84 


40 


834 


U93563 


Homo sapiens 


putative pi 50 


262 


54 


no C 

835 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


214 


80 


836 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


107 


48 


837 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/o70. 


158 


76 


838 


AF194537 


Homo sapiens 


NAG 13 


153 


60 


839 


XJ52011 


Homo sapiens 


mariner transposase 


344 


67 


840 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


104 


46 


841 


W90847 


Homo sapiens 


Human lymphocyte targeted peptide 
#15. 


130 


92 


o4z 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


114 


36 


o43 


CjUJ 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


196 


43 


OA A 

o44 


Xyz4o5 


Plasmodium 
vivax 


pval 


102 


73 


QA C 

o4j 


"V<£ 1 A/1 O 

AolU4o 


Hydra sp. 


mini-collagen 


106 


41 


o4o 


VkTA QO CI 

W4o351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


48 


o4/ 


117/1 Q2C1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 12 


54 


o4o 


PAO /PO 

urU343o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


128 


68 


OH-y 


A TT1 QA C^T 
/\r 1 274 J3 / 


Homo sapiens 




i/ii 
141 


AH 

4o 


850 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


121 


45 


851 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


153 


58 


852 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


220 


60 



151 



WO 01/64835 



PCT/USO 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


OJJ 




xr loo 111 U Ul UI11 

vivax 




pva 1 


1 97 
iz. / 


A < 


OJ^r 


AF1 ^430S 




OldlJ 


00 

77 




855 


D381 12 




~NFAT"VF-T (~\f*\i\/r\r'r\CTf*ir\ ncp cuVinnTf 9 
X\/ \LJL 1 UCiiy Ul UgCiJdoC OUUIUIIL JL 




Oo 


856 


S80119 


Rattus sp. 


reverse transcriptase homolog 


159 


56 


857 


AF1 300R9 


xxUllHJ £>ajpiCllo 




1 1 9 

1 1Z. 




858 


AK024372 


Homo sapiens 


unnamed protein product 


129 


50 


OJ7 




jnomo sapiens , 


Human secreted protein, SEQ ID NO! 

7R71 




nc 
/O 


R6fl 


D^RI 1 9 


lTnmr\ campnp 


Lvjuiirume c oxiud.se bUDunit d 


97Q 


RO 


861 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

99 1 A 


96 


44 


862 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


83 


40 


863 


AF2 10651 


Homo sapiens 


NAG 18 


122 


63 


004 




Mus musculus 


endonuclease/reverse transcriptase 


1 f\C\ 

109 


51 


865 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


115 


34 


866 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

a A no 
44 fo. 


107 


80 


867 


X92485 


Plasmodium 
vivax 


pval 


105 


41 


868 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A 1 O 

4478. 


127 


62 


869 


X92485 


Plasmodium 
vivax 


pval 


105 


38 


O /U 




Homo sapiens 


Muman secreted protein, bh,Q 1JD JNO: 

A /ITS 
44/5. 


90 


56 


871 

0/1 


i Z /D / 1 


: 

Homo sapiens 


Human secreted protein encoded by 
gene ino. 0. 


lot? 


11 


R79 






nunidii se ere tea proiem, utiy llj invj. 
7214. 


1 9^ 


DO 


R73 

O / -J 


AF1 30080 

r\J7 J.JUU07 


JTlUillu sapicilb 




IOU 


89 


874 


AF 118082 


Homo sapiens 


PRO 1902 


143 


65 


87^ 




Homo sapiens 


putative pi 50 


1 OA 


A A 

44 


876 


M10546 


Homo sapiens 


cytochrome oxidase I 


248 


75 


Off 


Uo33U3 


Homo sapiens 


line-1 reverse transcriptase 


127 


52 


878 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o71. 


106 


35 


o /y 


U 1 1Z&& 


Drosophila- 
melanogaster 


diaphanous protein 


93 


46 


880 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 
proiem rnaia. 


71 


47 


881 


B08942 


Homo sapiens 


Human secreted protein sequence 
encouea oy gene io oxii^ iu iNU.yy. 


95 


40 


RR9 


API ^OftRQ 


xioino sapiens 


PT? CYJ ^ a 


1 in 
13 / 


A A 

44 


RR' 1 * 

OO J 




nomo sapiens 


rivUuOJ / 


1 49 ! 


HI 
ID 


884 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/0O0. 


97 


60 


RR^ 


A01Z70 


£\.a.liUb 

norvegicus 


Anon ■i-oo/'lit-irr •fV'orv^ci 1 

open reading irame z 


1 DO 


A 1 
43 


RR6 


VI 07^7 
I iy /O / 


Homo sapiens 


oJbv^ 1JL/ JNU 4oD rroiTi Wvjyyzzz43. 


Q1 

¥ 1 


O / 


887 


X14963 


T-Fr>mr> Qanipnc 


r*r»11acif a r»— nrrvt^i n {A,A'~7 A A \ 


1 ^0 


SI 

0 1 


888 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


117 


55 


889 


L25616 


Homo sapiens 


CGI protein 


150 


62 


890 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


171 


66 


891 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 


107 


40 



152 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






— — 


iragment, ob^ id jni-J.jo/. 






QOO 




Bos taurus 


IniofAna TJO A r 7 / A A 1 10 7^ 

nistone rizA.z, (AA l-iz/) 


Q its: 


79 


oyo 


v uuooz 


Homo sapiens 


UKJr I (JNAJJri dehydrogenase suDunitj 


1 AO 


46 




l—J-J Ol 1Z. 


AiUIllU bcipiCllb 


locxironie c oxidase i>uDunii i 


1 ^73 


ID 


0:7 _> 




r lasmoaium 
vivax 


pval 




4 / 










I \J\J 




897 


D5Q926 


Homo sapiens 


The KIAA0136 gene product is novel. 


280 


89 


0-70 




Homo sapiens 


precursor polypeptide 


1 14 


yD 






Homo sapiens 


gamma-glutamylcysteine synthetase 


i m 


nn 
9U 


con 




Homo sapiens 


Neural thread protein. 


rvr 

yj 


75 


901 


L27428 


Homo sapiens 


reverse transcriptase 


81 


47 


00.0 


VI T71 1 


Mus rnusculus 


rro-Pol-du I rase polyprotein 


1 AA 

104 


53 




r\f\A 1 o 
OUU4iz 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


105 


76 


n A/t 


A IT A 1 A1 /I/I 


Homo sapiens 


neuronal thread protein AD7c-NTP 


126 


37 


9U3 


Ar 130089 


Homo sapiens 


PRO2^i>0 


88 


82 


yuo 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


230 


50 


907 


L27428 


Homo sapiens 


reverse transcriptase 


114 


64 




G00377 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4458. 


142 


71 




U93570 


Homo sapiens 


putative pi 50 


130 


41 


910 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


91 


63 


911 


L27428 


Homo sapiens 


reverse transcriptase 


168 


46 


912 


X92485 


Plasmodium 
vivax 


pval 


91 


60 


913 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


103 


64 


914 


G03789 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
7870. 


96 


64 


01 < 




Homo sapiens 


Human d bol related polypeptide o-bQ 
ID NO: 1051. 


121 


51 


Oil £. 

916 


O AO CA1 

G02501 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6582. 


87 


77 


917 


Miz(J99 


Mus rnusculus 


proline-rich protein 


129 


44 ; 


918 


M15530 


Homo sapiens 


B-cell growth factor 


88 


51 


919 


Afn AA1A 

Ar 130079 


Homo sapiens 


PR02852 


158 


88 


920 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


103 


77 


921 


X53581 


Rattus 
norvegicus 


ORF4 


124 


32 


922 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


110 


50 


923 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


99 


36 


924 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


80 


47 


925 


AC006127 


Homo sapiens 


BRG-1 -HUMAN ; nuclear protein 
GRB1; homeotic gene regulator; SNF2- 
BETA; MITOTIC GROWTH AND 
TRANSCRIPTION ACTIVATOR; 
POSSIBLE GLOBAL 
TRANSCRIPTION ACTIVATOR 
SNF2L4 


442 


92 


926 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


126 


39 


927 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


321 


76 



153 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


Q7S 




thaliana 


pu.la.il VC Oi31Nn"JD//TJVv>"l/lIlUiIig 

protein-like protein 


07 




Q7Q 
y^y 


Yf)Q R77 




put. precursor 


1 17 




930 


U93563 


Homo sapiens 


putative pi 50 


325 


70 




AF5H1 7777 


Pomp foi'nih'Sric 

Senile* icmiiiidrib 


\JX\JrJ. 


7fV7 




932 


AF053538 


Alvinella 
pompejana 


fibrillar collagen chain FApl alpha 


114 


37 






Homo sapiens 


riuman secretea protein, briv^ IN<J. 
7871. 


1U / 


d7 






Homo sapiens 


vJKr i , iviJiKJ / , putative transposase 
similar to pogo element 


Zoj 






T T/l 1 ft 1 ^7 


Caenorhabditis 
elegans 


repetitive region; weakly similar to E. 
gracilis major membrane skeletal 
protein \jr \.js.^j\f\D i \ i / j 


1 C\H 




936 


U47855 


Araneus 
diadematus 


fibroin-3 


109 


33 


937 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

11 1 A 


95 


43 






— — ; 

Homo sapiens 


Human breast cancer related protein 

D^lvDZ. 


1 AO 


jO 


Q7Q 


1VI 1 J 1 u 1 


ivaUUS 

norvegicus 


unknown protein 


1 O 1 


/in 


y*t\j 


\Jl/UOo7 


riomo sapiens 


jriuman secretea protein, oriv^ ru rNLJ. 

4770 
*+ / y o. 


ill 


/l^ 




AFfil 0144 
r\JTU IVJ 1*H* 


xiomo sapiens 


neuronal tnreau protein j\u i c-rs 


oo 


^7 


942 


L27428 


Homo sapiens 


reverse transcriptase 


86 


45 






Homo sapiens 


putative pi 50 




A C\ 


944 


Y64890 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
iju inl^. iuj i . 


110 


48 


QA^ 


T 7747 £ 
ZHAO 


— : 

Homo sapiens 


reverse transcriptase 




OO 


946 


AF1 94537 


Homo sapiens 


NAG 13 


146 


47 


yH- / 




Homo sapiens 


Human secreted protein, SEQ IT) NO: 
6619. 


O 1 


< ^ 


QAQ 




Arabidopsis 

inalldlla 


putative Uri JN r-D/AKo- Dinaing 
protem-iiKe protein 


111 


J / 


y^vy 


AF1Q4S77 


jriomo sapiens 


IN/WJ 1 3 


i 

1 uo 


OO 


950 


Y '14437 


Homo sapiens 


Human secreted protein encoded by 
gene i* i t-ione no/\ w /-\z / . 


74 


75 


951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


78 


66 


952 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


85 


72 


yDj 


T T44Q7R 


Glycine max 


extenstn 


1 A < 




Q^A 


i\l jJJD 


riomo sapiens 


jrrotem encouea aownstream oi nnc ivl 
oncoprotein. 


1^1 


70 


y~> ~j 


UUJoU 1 


uoniu sapiens 


rrurnd.il secreteu protein, ojcv^ iju inu. 
7SS7 


I/O 




956 


(^00^54 


stujilhj atxpicii^ 


4435. 


1 UJ 


1A 

1 H- 


957 




nomo sapiens 


rlvUZjJu 


170, 


77 




flfnn47 


riomo sapiens 


numan secretea protem, oriv^ iu ini_/. 
7124. 


11^ 


71 


yjy 


APnonoj.9 

r\F yjyyjy^rZ. 


riomo sapiens 


Jrrv^JUOD / 


O J 




960 


T 27428 


Hnmn QanipnQ 


l&VClijC LA ulloul iULaow 


121 


30 


961 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


157 


68 


962 


AF090942 


Homo sapiens 


PRO0657 


138 


61 


963 


U83280 


Leishmania 
donovani 


39 kDa antigen 


101 


53 


964 


U93566 


Homo sapiens 


p40 


102 


31 



154 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


965 


G03786 


Homo ^aniens 


Human ^rrpfpH nrn tfMn SFO TD WO* 
7867. 


104 




966 


G00689 


Homo sapiens 


Human secreted nrotein SEO ID NO" 
4770. 


136 


40 


967 


AF090930 


Homo sapiens 


PRO0478 


158 


80 


968 


ABO 12223 


Canis familiaris 


ORF2 


94 


36 


969 


AF010144 


Homo ^anipn^ 


neuronal threnH nrofpin AF)7r-NTP 

JLlwULl VJJL1Q.1 LULL L'uU IJJL LJLt/liJt i\ / V_> X N AX 


127 


— '^t 


970 


X53581 


Rattus 


ORF4 


163 


43 


971 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7534. 


109 


56 


972 


G03 1 14 




T-Tnmsiri Q<=>pt*£»fwi TM*otp»iTi QPH TO MO* 

JTlULllu.lL bCLlCltU j^lULCLUj ODV^ XXJ 1NV-/. 

7195. 




47 


973 


Y69166 




a maim c uuiiiaii iN-a^Giy igiy^ubdiiiiiij'i 

transferase protein. 


OS 


oU 


974 




WnmA Q?mtf»nc 


Rnman c^rre^t^d nrotein QPH TO TsTO* 

nilLlJUUl P^lOlClllj «3J3V^ xxJ YWJ . 

7176. 


74 




975 


U93574 


T-Tr\mr\ cani c 
ixk/iiiu ocijJiCiia 


]-J U Lett J VC jj I J \J 


1 AO 


4^ 


976 


VV tOJJ 1 


T-Tr»TTir» cam" pii c 

llUlllU DCipiC/llO 


llU.Liid.Ll UlC/uot OdllUCI ICldlCLl JJLULClll 


70 


OD 


977 


Y08062 


T-Tnm a ^arii^rx: 


Human T^T?094S nrotpin fr?»crm^Tit 

derived from DNA35638. 


LIU 




978 


G03411 




Human «ipr-reterl nrotein RPO TO NO- 

JL JLLLllltTLl OCL/ICLUU Wtt'ILl, Ot-iV^ xxJ IN V^. 

7492. 


i to 




979 


AF1301 14 


Homo ^fjnifn*? 


PR02459 






980 


K03202 


Homo <sar>ipiiQ 

llwlllVJ ocljJLt^lio 


Qnlivarv "nroline-ripli nrotpin rvr(=nii*cr^r 
oa.iiva.i_y \j\ \j mi t/*"i its ii iji ulc iij L/icLiiHotJi 


QO 


40 


981 


AF1 16909 


T-Trvmfi csa'nifttiQ 

JL JLVlllVJ DlXJJICJLI ;> 


nnVn own 

UllJVllLr W 11 


1 1 s 

1 iJ 


49 


982 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


106 • 


75 


983 


Y 14674 


Plasmodium 
falciparum 

At4.LV/ jvUSCLI. U-1AJ. 


glutamate—cysteine ligase 


106 


66 


984 


AF229067 


Homo sapiens 


PADI-H protein 


152 


60 


985 


AFI 1990(1 


MAinn canipn c 
x XyJiiLSj oaiJicilD 






48 






T-T i"\Tin r\ c a t> i pt^ c 
JLjlCUIILJ oatL/iCllo 


llLtU tall iCCrctCU piOLClii, OiJ.v^ xi^ LS\J. 

7795. 






987 


R13556 


Homo <iariiftTi«i 


TProt^iTi PTiP.oHed Hfiwn^trp^m o^hHr* A/I 

XlVJl.t/111 C^llUULLt/Ll WlloLl v/Cllll Ul 1JU1V_- 1V1 

oncoprotein. 


1 06 


fiO 


988 


G03 1 1 9 


Homo ^aniens 


Human ^perereH nrotein SP'O TO NO" 

1 1 HILL Ctll j&vl vLt«U JJl \J Lv> 111 9 kJ J_J/V»{ 1 J-^ 1> V^ 7 . 

7200. 


O l 


^/ 1 


990 


U35730 


Mus musculus 




133 1 


29 


991 


AFI 13685 


Homo sapiens 


PRO0974 


136 


63 


992 


U52077 


Homo saniens 

xx\jxxl\j ijuuivxia 


mariner tranino^p^e 


497 


77 


993 


Z97211 


Schizosaccharom 
vces nomhe 


kinesin-like protein 


197 


47 


994 


AF010144 


Homo sapiens 


neuronal thread nrotein AD7c-NTP 


124 


68 


995 


U93 563 


Homo sapiens 


putative pi 50 


157 


50 


996 


ABO 15802 


A cetohaeter 
xylinus 


cimilar to pplliilo^e pomnlempntinor 
nrotein of A xvlinum ATOC23R69 


140 


S4 


997 


G02363 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6444. 


144 


60 


998 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


352 


75 


999 


B03148 


1-TorriO Qfirvipnc 


TTi i m n n <^n t^riTi n 1 H i f F r'l^f^trt'i d+ir^n -fViz-^-f-rM- 
11111110.11 iicuiuiiai unici CLiiiaiiuii iaC/Lt;r~ 

1 (NDF-1). 


S94 


7R 


1000 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


377 


70 


1001 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


109 


50 


1002 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


249 


61 


1003 


M14702 


Murine leukemia 


pol poiyprotein 


206 


48 



155 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






virus 








1 0f)4 


£jZ JLJ U / 


noniu Sapiens 


iiuiiiaii elongation iduior-i-ueiia 


S 1 1 

D l 1 




1005 


L27428 


Homo sapiens 


reverse transcriptase 


176 


63 








inm. LJri aenyurogenase suduihl j 




Tl 
/ 1 


7 aat" 




riomo sapiens 


PT?r^Al T H 
y:\SXJ\J 11/ 


1 /C9 
1 OZ 


OD 


lUUo 


rznri i no 

UUj 1 UZ 


rioiiio sapiens 


xiuman secretea protein, ocy iu inv_a 
7183. 


1 A A 
1 


OD 


1UU7 


T Ld4R^R 


vJiyLirie niax 


exLcnsni 


1 00 








.r lasmooium 

laioipcll iXlxl 


glutamic acid-rich protein 


1 1 A 
1 14 




1011 


L27428 


Homo sapiens 


reverse transcriptase 


114 


52 


1U1Z 


Ar 1 jyJyJtSy 


Homo sapiens 


rxsXJZD jU 


1 1 A 

1 14 


/ / 


1013 


G04092 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

0 1 T2 
51 1 5. 


81 


44 


1014 


AF090895 


Homo sapiens 


PRO0117 


97 


65 






Homo sapiens 


T>T> r\0 CCA 


168 


83 


lUlo 


AF079:>o7 


Mesocricetus 
auratus 


cytochrome c oxidase subunit III 


276 


52 


101 / 


A T?AC C AOC 


Onchocerca 
volvulus 


pyrrolidone-rich antigen 


94 


44 


1018 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3 . 


114 


46 


1019 


AF090944 


Homo sapiens 


PKO0663 


137 


50 


lUzU 


Yzl 166 


Homo sapiens 


Human bc!2 proto-oncogene mutant 
protein fragment 14. 


84 


36 


1021 


A T -1 A 1 A1 /l /I 

AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


140 


37 


1022 


AT A/1A£!AO 


Arabidopsis 
thaliana 


extensin-like protein 


105 


34 


1 AT 1 
lUZo 


YUZ 74y 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


152 


52 


1 AO /I 

1UZ4 


(jrl)o / 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


125 


66 


1025 


M12140 


Homo sapiens 


envelope protein 


143 


62 


1026 


T T/1 AAO'7 

U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


200 


58 


1UZ / 


T T/1QOT2 


Homo sapiens 


UKr 1 , JvlbKJ / , putative transposase 
similar to pogo element 


ZOO 


67 


lUZo 


ArU41^iU 


Bodo saltans 


NADH dehydrogenase subunit 5 


97 


31 


1029 


G02950 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7031. 


102 


56 


1030 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6563. 


76 


63 


1031 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ah i o 
4718. 


74 


56 


1 AIO 

1U3Z 




Homo sapiens 


Protein sequence and annotation 
available soon via Swiss-Pro t; available 
at present via e-mail from 
LABEIT@EMBL~Heidelberg.DE 


389 


1 A A 

100 


1 AH 


OUZOJH- 


xi-omo sapiens 


riuman secreieaL proiem, ocy iu inl/. 

O / J 3. 


yo 


/t A 
4U 


1034 


AF01 A1 Ad 


numo sapiens 


neuruiiai uireaci pruLcm jt\u /o-in i r 


1 1 £ 
-5 io 




1 ai^ 


API Q/1^17 


Homo sapiens 


>j a n 1 1 


ZUo 


jZ 


1036 


T 27428 


TTaitia cQT^if^rt c 


icvoisc Li cuiowl luia&v 


1 oo 


si 

J 1 


1037 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


44 


1038 


AF1 16638 


Homo sapiens 


PRO 1546 


56 


61 


1039 


U93570 


Homo sapiens 


putative pi 50 


138 


40 


1040 


AF130089 


Homo sapiens 


PRO2550 


150 


91 


1041 


AK024455 


Homo sapiens 


FLJ00047 protein 


151 


68 



156 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






Homo sapiens 


Human secreted protein #28. 


Q7 

y f 


A 1 

41 


1 ha^ 




jnomo sapiens 


putative pi_>u 


1Z4 


34 


1 AAA 


T TAQQ'7'2 


Homo sapiens 


UKr i , ivLbJto / ? putative transposase 
biuiuai lo pogo cierricriL 


zoo 


71 


1045 


U93563 


Homo sapiens 


putative pi 50 


246 


54 


lU^O 


VjrUU^f 1 0 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


1 3 I 


04 


1 C\A1 




Homo sapiens 


putative pi 50 


1 07 




1048 


AF130114 


Homo sapiens 


PR02459 


117 


67 




TT10Q1Q 


Mus museums 


adenylyl cyclase type VII 


1 h c\ 
170 


68 




ACUUoUo4 


Leishmania 
major 


t QAC1 1 

L8453.1 


129 


30 


i rw i 


yvyy4o / 


Medicago 
truncatula 


JbJNUJJZU 


1 1 r\ 

1 10 


0 0 

35 


1052 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


150 


76 




Ar 1 10/12 


Homo sapiens 


PRQ2 /\> 8 


109 


44 


1U~>4 


M96256 


Homo sapiens 


rapamycin binding protein 


168 


56 




T T1 *Z£Z AH 


Mus mus cuius 


reverse transcriptase 


86 


37 


1056 


AL024498 


Homo sapiens 


dJ417M14.2 (novel serin e/threonine- 
protein kinase (ortholog of mouse and 
rat MAK (male germ cell-associated 
kinase)) 


190 


72 


1057 


AF090942 


Homo sapiens 


PRO0657 


103 


63 




A Crt 1 At /t/i 


Homo sapiens 


neuronal thread protein AD7c-NTP 


156 


68 


1059 


AF081114 


Mus musculus 
domesticus 


ORF2 


134 


47 


i r\ezr\ 


Y 02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone Jri J DAD22. 


180 


83 


1061 


U70935 


Peromyscus 
maniculatus 


reverse transcriptase 


126 


45 


1062 


G03790 


Homo sapiens 
— 


Human secreted protein, SEQ ID NO: 
/o/l. 


84 


55 


IUOj 


T T1 <£/f7 
U 1 J>0'+ / 


Mus musculus 


reverse transcriptase 


95 


38 


1U04 


U 7JJO / 


Homo sapiens 


putative pi 50 


128 


CO 

58 


xKJOD 


Y 144 /y 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 




66 


1UOO 




Plasmodium 
vivax 


pval 


1 in 

1 19 


62 


1067 


U93567 


Homo sapiens 


p40 


161 


48 


lUOo 


1 1 1 

UjSI iz 


Homo sapiens 


cytochrome c oxidase subunit 3 


j>4U 


84 


1069 


U93570 


Homo sapiens 


putative pi 50 


107 


59 


1 C\1f\ 


A T?"70 1 fiC 1 

AJr5zlUjl 


Chalinolobus 
tuberculatus 


cytochrome c oxidase subunit III 


j>33 


71 


1071 


U93567 


Homo sapiens 


p40 


99 


28 




OU4Uy 1 i 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 




00 


IU / J 


YooZ4o 


Homo sapiens 


Human secreted, protein HCriProS, 
SEQ ID NO: 163. 


136 


59 


1 CY1 A 




Homo sapiens 


putative pi 50 


140 


53 


1075 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


66 


43 


lU/O 




Arabidopsis 
thaliana 


extensin-like protein 


IVJD 


J / 


1077 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


94 


66 


1078 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


81 


57 


1079 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


83 


35 



157 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








0 1 TO 

01 /Z. 






1 AQA 


Ar J OZ 1 4y 


Mycoplasma 


variable surface lipoprotein 


1 A"» 


41 






bovis 








iUo 1 


vjrU J 1 f Z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 A Q 
145 


75 


















jreromyscus 


reverse transcriptase 


1 O 7 
1Z 1 


/fO 

4Z 






in dnLcuidiiis 








lvOj 


T TQ^S^d 

l-J 7JJUt 


jtiuiiiu oapiciis 




07 


/lO 

4Z 


1 DRA 




Homo sapiens 


JrAJLii-ri protein 


1 A ^ 

14_> 


ol 


IUoZ) 




Homo sapiens 


UKr i , M-bKj / 7 putative transposase 


ZZl 


A 

60 








similar to pogo element 






lUao 


Uooj /J 


TT 

Homo sapiens 




165 


67 


1 A 527 


Y JOIDO 


Homo sapiens 


Human secreted protein #28. 


93 


72 


1 fiQQ 

lUoo 


A T71 QA 


Homo sapiens 




142 


62 


IUoV 




Homo sapiens 


Human secreted protein sequence 


9j 


62 








encoded by gene Zo bev^ l 1 -* JNU: 133. 






luyu 


A T? 7 A /I C7-7 

194537 


Homo sapiens 


"XT A /-» 1 *5 

NA013 


155 


40 


1 aci i 


A/t a a 1 

G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


103 


35 








8172. 






i aao 


A T? A 1 A 1 /I /l 

Ar010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


195 


40 


1093 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


91 


46 








similar to pogo element 








X53581 


Rattus 


ORF4 


106 


62 






norvegicus 








1095 


AT 1 SI f\ A r\~> 

AL 160493 


Leishmania 


probable (hhv-6)ull02 5 variant a 


129 


51 






major 


DNA, complete virion genome 






1096 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 


72 . 


33 








rragment, bbK^ ID NU.aoo. 






1097 


U4U34Z 


Mus museums 


ninein 


152 


44 


1 MOO 


MZ4/3Z 


Homo sapiens 


lamin-like protein 


92 


37 


1099 


X92485 


Plasmodium 


pval 


111 


67 






vivax 








1100 


Y27868 


Homo sapiens 


Human secreted protein encoded by 


156 


86 








gene No, 107. 






1 1 A1 

1101 


A TJAAA/T 1 £T 

ArOOOolo 


Oryza sativa 


similar to RING-H2 finger protein 


1 16 


90 








TJU A 1 n ( A T? f\n o tz o \ 

KrlAla (ArU /do&3) 






1 1UZ 


LfUU4J / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


158 


71 








4 J 1 






1 1 A'S 

1 lUJ 


LrUJ 1 J J 


—— ; ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


36 








TO 1 A 

J A 14. 






1 1U4 


uyjD /Z 


— — - ; 

Homo sapiens 


putative p 1 50 


168 


56 


1 llO 


r TA"2 <^7f\ 


Homo sapiens 


putative pi 50 


96 


40 


1 7 a< 

1 iUO 


LjZ /4Zo 


Homo sapiens 


reverse transcriptase 


188 


43 


1 1 CYl 


AJJJO I 


Kattus 


ADT/1 

UJvr 4 


1 A 1 

341 


43 






norvegicus 








1 1 AQ 
1 lUo 


VOI ^77 


Homo sapiens 


Human secreted protein sequence 


344 


77 








encoaeo oy gene z ojiq iu jn vj:z«>u. 








nno/toA 
ouz^fyu 


—— : 

Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 0/< 


OO 








£^71 
O J / 1 . 






1 1 1 A 


U7JJ07 


— — : 

Homo sapiens 


putative pi 50 


1 c*c 
1 JO 


38 


Till 
1 1 j. i 




Homo sapiens 




13j 


j4 


1 1 1 o 
1 1 1Z 


x UZO / I 


Homo sapiens 


Human secreted protein encoded by 


71 


63 














1113 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


124 


62 


1114 


L27428 


Homo sapiens 


reverse transcriptase 


200 


40 


1115 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


101 


50 








7077. 






1116 


L27428 


Homo sapiens 


reverse transcriptase 


122 


70 


1117 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


65 



158 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








4718. 






1 1 lo 


W40JJ 1 


Homo sapiens 


Human breast cancer related protein 

J3^rut>z» 


i 1 
1 lo 


65 


1119 


D38484 


Hylobates 
synuaciyius 


Cytochrome C oxidase subunit 1 


315 


89 


1120 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


115 


76 


i iz 1 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 Jo 


68 


1 177 




noaio sapiens 


ubicjuitin C-terminal hydrolase 


103 




1123 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7££7 
/oO/. 


94 


55 


1124 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

&*\71 
OD / 1 . 


154 


65 


i i7<; 


AThA1 A1 AA 


— — : 

Homo sapiens 


neuronal thread protein AD7c-NTP 


1 A£ 

iUo 


JO 


1 17^ 


aft ^aabq 


riomo sapiens 


pt? n7 ^ A 


JO 


IZ 


1 1 


\J\)jLOjZ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Oy 1 J. 




jy 


1128 


Y14479 


Homo sapiens 


Fragment of human secreted protein 
encoucci oy gene i^. 


80 


59 


1 17Q 


API 1 


. 

Homo sapiens 


PPA1 9.A7 
risXJ 1 o4 / 


140 


7A 


1 1 

I IjU 


A 171 OAS ^7 


riomo sapiens 


AT A 0, 1 7 


1 C7 
1 oZ 


00 


1131 


L27428 


Homo sapiens 


reverse transcriptase 


173 


38 


1 1 


TTQ^^TA 

U^70 J /U 


Homo sapiens 


putative pi 50 


1 1 A 

i iy 


1 A 


1 1 


r^AA/1 A'? 


Homo sapiens 


Human secreted protein, bh^i ID JNU: 


1 C7 

15 / 


68 


1134 


AJ004810 


Zea mays 


cytochrome P450 monooxygenase 


79 


87 




A TP A 1 A 1 A A 

ArulU144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 JO 


70 


1 1 Q/^ 




Homo sapiens 


Jbiuman secreted protein, biiQ 1L> INU: 

JJOJ. 


O 1 

51 


TO 

Id 


1137 


ABO 18705 


Mus museums 


ORF2 


138 


36 






Homo sapiens 


protein serine/threonine kinase 


1 ^A 

130 




i loy 


W4oj D 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 




/CA 
OU 


1 izin 


API CM 7 


Homo sapiens 


ATA CI \ 3 


1 i r 


JJ 


1141 


U93564 


Homo sapiens 


putative pi 50 


135 


51 


1 1 /io 
1 142 


JL>ODo3-> 


Catharan th us 
roseus 


extensin 


142 


^7 


1 14J 


JJUUD /U 


Mus musculus 


open reading frame (25 1 AA) 


nil 


c n 
jU 






Homo sapiens 


Jtluman secreted protein, J>h.Q iu NO: 

71 # A 


62 


38 


I 14j 


/U0o4 


— . 

Caenorhabditis 

filler ci ti o 


T77QF41 Q 
rioi'1.8 


1 AC 

IUj 


on 

jZ 




no^ i ^ 

VJUJ XJJ 


rioiiiu bdpiens 


xiuiiioii aet/ietcQ protein, jjiy lu vskj. 
7214. 


70 


S4 
34 


1147 


Y36156 


Hnmn can i pn c 


t3Pf*rpl"pH wntpin 


1 SI 


67 


1148 


A23786 


Beta vulgaris 


chitinase 1 


98 


37 


1 140 

i i.'-ty 


AF17Q7S6 


n.\JIIlU 5d.pi.cnb 


RAT7 


1 77 
1 / / 


DZ 


1150 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


147 


73 


11^1 




Homo sapiens 


reverse transcriptase 




^ i 
j 1 


1152 


U34044 


Homo sapiens 


selenium donor protein 


238 


48 


1 1 <i 


A TSflOAAtZtZ 

A15vl>Z44D3 


Homo sapiens 


"CT TAAAvt'7 „ „^4-^^ ^, 

Jb JLJ UU04 / protein 


78 


78 


1154 




nuiiiu bapjCIlb 


1 lT? 1-7 1 " l\ /T 1-4 1? 7 / • "rvn"i~Cl'1'i\/o frdnonrvfico 
V^rSJT 1, IViXIyJvJ /, pULaUVC U allbpOSabC 

similar to pogo element 


1 Q7 
iy I 


O ( 


1155 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


120 


64 


1156 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


98 


34 


1157 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


138 


54 



159 



WO 01/64835 PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1158 


AF 194537 


Homo sapiens 


NAG 13 


106 


48 


lljy 


T 1A 007^1 


Homo sapiens 


ORF1; MER37* putative transposase 
similar to pogo element 


J) J / 


69 


1 1 

1 lOU 


U7JJ I A, 


; 

Homo sapiens 




&Q 

07 


a n 

4 / 


1161 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

79 


184 


68 


1 1 \>sL 


\J\JD / o / 


rioiiio sapiens 


iiu.iiid.ri secreien proiem, oxiv^j ilj inu. 
7868. 


1 93 


J I 


1 1 a ^ 


A XT 1 1 £T] 1 O 

/\r 1 I o / i^z 


Homo sapiens 


pp m7^s 


1 9Q 


oy 


1 I o<+ 


v uuooz. 


Homo sapiens 


cytochrome oxidase I 


400 




1165 


G00416 


Homo sapiens 
■— — 


Human secreted protein, SEQ ID NO: 

AA Q9 


118 


57 


1 1 DO 




Mus musculus 


histone Hie 


i n/i 
1U4 


in 

3 J 


1167 


X70343 


Nicotiana 
syivesiris 


extonsiii 


95 


33 


1168 


AF130051 


Homo sapiens 


PRO0898 


117 


43 


1 1 /CO 

1 loy 


t i /I 
OU3 / 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

T7Q< 
/ lyD. 


104 


48 ! 


1170 


L27428 


Homo sapiens 


reverse transcriptase 


149 


33 


i m 
1 J / 1 


(jrUD4y / 


Homo sapiens 


Human secreted protein, bhQ ID NO: 
4578. 


108 


62 


1 1 T~% 

LI 1 Z 


Ar 03 0277 


1 ragelapnus 
spekii 


cytochrome oxidase subunit III 


266 


54 


1 1 HI 

IS. 1 5 


L,220JU 


Glycine max 


hydroxyproline-rich glycoprotein 


87 


38 


1 1 T/t 

11/4 


U4yy /J 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


168 


45 


i lor 
1 1 ID 


U4Jo2/ 


Arabidopsis 
thaliana 


extensin 


1 1 1 


42 


1 1 76 


U43627 


Arabidopsis 
thaliana 


extensin 

. — 


98 


29 


1 1 HH 
1 1 J 1 




Homo sapiens 


putative pi 50 


OA 


CO 

DO 


11 / o 


Tfll (Vt9 
Jul U4 / 


Caenorhabditis 
elegans 


collagen 


1U& 


o A 


1 1 TO 

i i / y 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
00 1 j< 


1 /u 


Q 
DO 


1180 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


113 


36 


lid 

llol 


A rA 1 A1 A A 

/\r U 1 KJ 1 H4 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 J>4 


OJ 


1182 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


107 


62 


1 1 

1 loo 


i oOZ4o 


Homo sapiens 


xluman secreted protein HCJtirJpoo., 

Ct?A TT~> K!W 1 

oiH^/ LL) IN <J . 1 Oj . 


124 


45 


1 1 9.A 
1 Io4 


JLo oi IZ 


riomo sapiens 


iN/vuxi Genyurogenase suDunit z 


41 a 


oo 


1185 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


106 


41 


1186 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
lragmeni, oxi^ ijlj lNL/.'^yu- 


381 


75 


1 1 o / 


JL/^O 1 IZ. 


nomo sapiens 


cyiocnrome c oxiaase suounii l 


A ~2 /t 
434 


Jy 


1 loo 




Jtiomo sd.pi ens 


iine-i reverse aa.nscripLase 


9^ 




1189 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

929n 


159 


68 


1190 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
DD / 1. 


146 


68 


1191 


AF 118086 


Homo sapiens 


PR01992 


146 


81 


1 1 09 




xiorno sapiens 


Truncated pro-alpha 1 (III) chain. 


l(Jo 


OD 


1193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


67 


34 


1194 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


116 


74 


1195 


Y3068I 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1. 


132 


82 



160 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1196 


G031 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

o 1 

7195. 


75 


48 


1 1 no 




Homo sapiens 


UKf 1, MEK37; putative transposase 
similar to pogo element 


O C/1 

234 


73 




T \Ci1 COA 


Homo sapiens 


p4U 




->0 


1199 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


145 


39 


1200 


AK024455 


Homo sapiens 


FLJ00047 protein 


115 


56 


inn 1 

1201 


A CAAAA/IO 

AF 090942 


Homo sapiens 


PKO0657 


88 


64 


1 o AO 


AA /I /I O 

u 00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A ■CO'? 


124 


70 


IZUJ 


Y JUo22 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


1 13 


46 


1204 


PA1 AO/1 

U019o4 


Homo sapiens 


Human secreted protein, SbQ ID NO: 

OUOD. 


81 


41 


lOAC 

12UD 


F T1 COQ A 
U JZ> / 


— 

Mus musculus 


jerky 


1 AO 


oo 
2/ 


1206 


U 15647 


Mus musculus 


reverse transcriptase 


191 


45 


1 O AO 

1207 


T T 1 C/l A 1 "! 

U 15647 


Mus musculus 


reverse transcriptase 


1 O A 

124 


50 


1 O AO 


G0341 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


140 


58 


1 O A A 


A T? A 1 A 1 A A 


Horn o- sapiens 


neuronal thread protein AD7c-NTP 


A 1 

91 


54 


1210 


AF1 19900 


Homo sapiens 


PR02822 


160 


81 


121 1 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


68 


1212 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


45 


1213 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


125 


61 


1214 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


102 


61 


1215 


G0356O 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7641. 


101 


46 


1216 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyprohne-rich glycoprotein DZ- 
HRGP 


270 


58 


1217 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ IDNO:387. 


105 


46 


1218 


AC00248j 


Homo sapiens 


putative product from mRNA sequence 
CG003 from BRCA2 region; match to 
U50534 (NID:gl685103) 


378 


97 


1219 


AF090895 


Homo sapiens 


PRO0117 


130 


58 


1220 


Ar 1 13685 


Homo sapiens 


PRO0974 


117 


60 


1221 


X61295 


Rattus 
norvegicus 


LI retroposon, a portion of its ORF2 
sequence 


126 


50 


12.12 


pno OO A 


Homo sapiens 


TTi.^.nn n —_. u ,^J_ n -J OPA TT \ XT/A, 

Human secreted protein, SEQ ID IN O: 
7870. 


1 A O 

148 


OA 

70 


1223 


W4o3Dl 


Homo sapiens 


Human breast cancer related protein 


AA 

99 


56 


'\ r y~)A 


T TQ7 CO/1 


— — z 

Homo sapiens 


putative pl50 


y3 


44 


IZZJ 


ATT7 QfkACI 


Homo sapiens 




1 "2 "2 
1 


/CO 




u y J D Oj 


xiomo sapiens 


putative pi 50 




4 / 


190'7 
1ZZ / 




noiuo sapiens 


L^ivr i , iviii£vj> / y puiaiive n^ansposase 
sunilar to pogo element 




4/ 


1228 


U93564 


Homo sapiens 


putative pi 50 


116 


47 


179Q 


W^l / JJ 


inuHio sapiens 


j.nijt-1 encooeci Dy tione Dy. 






1230 


U15647 


Mus musculus 


reverse transcriptase 


105 


42 


1231 


U93563 


Homo sapiens 


putative pi 50 


299 


54 


1232 


R95913 


Homo sapiens 


Neural thread protein. 


138 


51 


1233 


AF130079 


Homo sapiens 


PR02852 


203 


70 


1234 


X53581 


Rattus 
norvegicus 


ORF3 


106 


60 



161 



WO 01/64835 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


12^5 


A "ni 1 O AO/C 

AF 1 180&6 


Homo sapiens 


Tvra r\ i a ao 

PRO 1992 


1 A A 

144 


81 


1236 


vao/J o c 
Xy24&3 


Plasmodium 
vivax 


pval 


IOC 


71 


i on 


A "PA 1 A 1 A A 


Homo sapiens 


neuronal thread protein AD7c-NTP 


I lo 


75 


1238 


U93572 


Homo sapiens 


putative pi 50 


133 


40 


12^ 


a i o a a 


Homo sapiens 


Human secreted protein, c>bQ Id NO: 
5330. 


/C A 

69 


56 


1240 


AF130089 


Homo sapiens 


PRO2550 


136 


41 


1241 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


109 


53 


1242 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


161 


75 


1243 


G02752 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6833. 


87 


45 


1244 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


128 


58 


1245 


U93570 


Homo sapiens 


putative pi 50 


161 


50 


1246 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


121 


45 


1247 


AF257305 


Homo sapiens 


ASH1 


576 


89 


1248 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


98 


68 


1249 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


107 


43 


1250 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


130 


91 


1251 


U63542 


Homo sapiens 


FAP protein 


1 16 


61 


12jz 


OA/1 AA 1 

G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


109 


53 


1253 


AF068294 


Homo sapiens 


HDCMB45P 


251 


63 


IOC/I 

1254 


AF090895 


Homo sapiens 


T*T> /^s A1 n 

PRO0 1 1 7 


1 1 1 


60 


1255 


G02827 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/T A A O 

6908. 


201 


75 


1256 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


51 


1257 


AB033032 


Homo sapiens 


KIAA1206 protein 


115 


80 


1258 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


122 


60 


1259 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


137 


68 


1260 


AB032906 


Hylobates 
pileatus 


dopamine receptor D4 


96 


35 


1261 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen; coded for by C. 
elegans cDNA yk55f3.3; coded for by 
C. elegans cDNA yk66d5>3; coded for 
by Q. elegans cDNA yk71e4.3; coded 
for by C. elegans cDNA yk55f3.5; 
coded for by C. elegans cDNA 
yk66a5.5; coded tor by C eiegans 
cjjjna yjK/ie4.j 


106 


38 


LZOZ 




— : 

Homo sapiens 


p4u 


1 oo 
lo2 


OA 

Jy 


1263 


L20096 


Manduca sexta 


ribosomal protein s7 


227 


59 


1264 


A T? 1 1 AAA 1 

AF1 19901 


Homo sapiens 


PR02831 


103 


71 




Do Jh-HO 


Homo sapiens 


Similar to a C. elegans protein encoded 
in cosmid C27F2 (U40419) 




y / 


1266 


D90279 


Homo sapiens 


collagen alpha 1(V) chain precursor 


120 


42 


1267 


L27428 


Homo sapiens 


reverse transcriptase 


111 


41 


1268 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


126 


68 


1269 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


102 


34 



162 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1270 


AF130089 


Homo sapiens 


PRO2550 


96 


69 


1 0*71 
LZ/ 1 


U IZ IK) / 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


lzl 


45 


1272 


AF165310 


Homo sapiens 


ATP cassette binding transporter 1 


243 


100 


!-<£ ID 




Homo sapiens 


Neural thread protein. 


1 1 A 

I 1U 


70 


lZ /4 




Plasmodium 
vivax 


pval 


1 A/C 


65 


1275 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

IZJ3. 


104 


56 


1276 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

1AA < 


53 


36 


1277 


X03725 


Mus musculus 


ORF 2 (466 aa) 


103 


41 


1 T70 


T TAO C^7A 

U93570 


Homo sapiens 


putative pi 50 


98 


43 


1 mn 

1279 


O Art /I AT 

G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
44oo. 


159 


83 


1280 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAIO 

443 0. 


167 


71 


Izol 


AJz / 1 0 / 1 


Nicotiana 
sylvestris 


putative extensin 


1 AT 

105 


36 




tSJJjZKjD 


Homo sapiens 


salivary proline-rich protein precursor 


1 19 


32 


1283 


AF010144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


124 


66 


1284 


G031 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

nine 

7195. 


135 


50 


1285 


A3 1039 


Nicotiana alata 


PRP3 


112 


36 


1286 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


116 


72 


1287 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


289 


67 




com 1 0 
00UJ iy 


T) 4-4-. , ,-, t _ 

Kattus sp. 


reverse transcriptase homolog 


1 12 


33 


1 OQQ 




Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


203 


76 


1 OOC\ 

lZy\) 


ATTA1 A 1 


— — : : 

Homo sapiens 


neuronal thread protein AD7c-NTP 


1 j 1 


JJ 


1291 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ZD J . 


128 


82 


1292 


AF130089 


Homo sapiens 


PRO2550 


127 


62 




ArUUJ jJj 


Homo sapiens 


UKrz-like protein 


1 A 1 

101 


48 




VI Q/C 1 A 


Homo sapiens 


MiQ iu NO dZo trom W09922243. 


1 AA 

100 


42 


1295 


L27428 


Homo sapiens 


reverse transcriptase 


126 


36 


129o 


GU0437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


140 


71 


i om 
LZy / 


T ^ A A 

L24433 


Oncorhynchus 
mykiss 


complement component C3 


359 


31 


1298 


AC004381 


Homo sapiens 


SA gene 


443 


55 


izyy 


GU124o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


180 


64 


i inn 


i 144 /y 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 14. 


135 


66 




Uo4j:7 1 


— — 

Mus musculus 


reverse transcriptase 


1 A/C 

lUo 


A O 


1 "2Pr"> 


Ui-jyj 1 


Nicotiana 
tabacum 


extensin precursor 


11/1 

134 


42 




1 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


idZ 




IjUH 


V1T71 Q 


Mus musculus 


Pro-Pol- dUTPase polyprotein 


93 


3o 






jnomo sapiens 


ppr Q 


1 no 

i\)Z 


10 


1306 


U93567 


Homo sapiens 


putative pi 50 


245 


63 


1307 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


130 


70 


1308 


AL355774 


Streptomyces 
coelicolor A3 (2) 


putative integral membrane protein 


136 


40 


1309 


W54966 


Homo sapiens 


Synthetic human type III collagen 


124 


41 



163 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








£> Y 1N-C3. 






1310 


M20670 


Plasmodium 
vivax 


circumsporozoite protein 


107 


34 


1311 


AF151366 


Arabidopsis 
thaliana 


arginine/serine-rich protein 


114 


36 


1312 


G03099 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
n i cn 


76 


43 


13 13 


OU3z_»<> 


— — : ; 

Homo sapiens 


riuman secreted protein, oJbv it) inu. 
/ joy. 


iuy 


Q< 
OO 


1314 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
Clone Jtti^iVi.tSv_J_>z. 


111 


37 


1315 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OjOO, 


123 


62 


1316 


R95913 


Homo sapiens 


Neural thread protein. 


98 


58 


i_> 1 / 


Ar 1 130o.> 


Homo sapiens 


r> T> C^iC\CiH A 

JrKAJUy /4 


1 7f» 

1 /u 


4 / 


l_>lo 


YUz / _:> 


Homo sapiens 


Human secreted protein encoded by 
gene j i clone riUisJ£.Ao_>. 


lzz 


ol 


1319 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A 

44V3. 


99 


68 


1320 


AJ242540 


Volvox carter! f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 


270 


56 


1321 


U93569 


Homo sapiens 


putative pi 50 


124 


37 


13zz 


A ITAOAO' 1 1 


Homo sapiens 


T)T> /"~\/"\ A r>--> 


iii 
111 


85 


1323 


Z" 1 f\A AO 1 


Homo sapiens 


Human secreted protein, _>bQ ID NU: 
o 1 /Z. 


85 


44 


1 'iO/l 
1_»Z4 




_____ . 

Pnaseolus 

vulgaris 


hydroxyproline-rich glycoprotein 


131 


43 




W45_01 


Homo sapiens 


Human breast cancer related protein 


1 A A 


_>4 


1326 


AF 194537 


Homo sapiens 


NAG 13 


125 


49 


1_»Z 1 


T OIAOQ 


Homo sapiens 


reverse transcriptase 


111 


4D 




U7JJUO 


jriomo sapiens 


putative pi_»u 


^ \ o 
1 1Z 




1329 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/o /U. 


110 


51 


1330 


L27428 


Homo sapiens 


reverse transcriptase 


142 


53 


1_»3 1 


vjrUlozo 


Homo sapiens 


Human secreted protein, SfcsQ ID NO. 
5909. 


A© 




133z 


ArUlU144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 1 o 

113 


55 


1333 


X71602 


Nicotiana 
tabacum 


extensin 


113 


35 


1334 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OO 1 A 

/Zl4. 


129 


41 


1335 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AO 1 1 

(3y 13. 


112 


50 


1336 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A Q 

443 D. 


102 


68 


1337 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
oo i a 


95 


51 


1338 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


114 


60 


i jj> 


LjUU3 iy 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4460. 


1 AO 

14Z 


£^ 
jj 




AJJUOJ 


A_y —-UpcIM- UH 

esculentum 




1 0'X 
1ZJ 


3 1 
_> i 


1341 


G02920 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7001. 


118 


70 


1342 


X71629 


Mus musculus 


msgl 


106 


57 


1343 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


105 


79 



164 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1344 


G03435 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/ J lu. 


116 


85 




AF1 fil^Sfi 






OO 




1346 


JO 1435 


Rattus 

x\ c\ r\ / o i r*i i Q 

X1V1 VCgiUUj 


ATPase 


348 


66 


1347 




xX\jxxx\j oCLyjiziixo 




142 


jo 


1348 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7R71 


87 


66 


1349 




Mrttnn c?inipn q 

JLJLLUllv odL'JLC'JLX^ 




1 S9 


Sd 

J4 


XJ -J v/ 


noo^s4 

\JVJ\JJ Jt 


± 1U11ILJ bcljJICllb 


XT-Uijicin ocuiCLcu piuicm, odv,/ aju' in \j. 

4435. 


i 


OJ 


1 1 






putative piju 




OH 


J. D-)M 




x xai>±xi U ti J ill i I 

V 1 VClA. 


pval 




70 
/U 


1353 


X61047 


Hydra sp. 


mini-collagen 


105 


36 


1 jJ't 


AF1 1 ^SdO 


JTlUillLJ bdplollb 


v v ii>ivULt~/-viurion oyiiuruiiic yx vjlciii 


1 9R 




1355 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

79 T A 


91 


39 


1356 


AF194537 


Homo sapiens 


NAG 13 


148 


63 


1 'X ^7 




Homo sapiens 




1 1Q 


Id 


1358 


X53581 


Rattus 
norvegicus 


ORF4 


208 


43 


1359 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


195 


50 


1360 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


328 


69 




W 6U4UO 


— — ; 

Homo sapiens 


A secreted protein encoded by clone 

AhACl ^ 

LLI1/+U J . 


Izo 


CO 


1362 


G02451 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


97 


47 


1363 


X53581 


Rattus 

11VJ1 VCglL/Uo 


ORF4 


110 


35 


1364 


U93569 


Homo sapiens 


putative pi 50 


123 


.41 


IJOJ 


Ar U 1 v 1 4*4- 


Homo sapiens 


neuronal thread protein AD7c-NTP 


1 14 


OJ 


1366 


X61294 


Rattus 
norvegicus 


Ll retroposon, a portion of its ORF2 
sequence 


153 


43 


1367 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene iuu cione xirNriu!70. 


69 


80 


1368 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

qi no 
51 /z. 


80 


46 


1J07 


KjU^-tUO r 


nomo sapiens 


jLiunian secreieu protein, ollk^ ilj lvkj. 
8148. 


it? 


zo 


1 ^70 


A7Z.T-OJ 


JT lobil 1 U 111 111 11 

vivax 


pval 


JLUO 


AC. 


1371 


U90946 


Dictyostelium 

LilO^WlLiC Ulll 


myosin heavy chain kinase B 


114 


62 


1372 


T 97498. 




IC/VCloC Li dllSL/I 1 L) LdbC 




^1 

u 1 


1373 


G03800 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


80 


1^74 


T T4QQ74 


xxKJiHsJ bcipiollb 


lllelllllCI UdllDpvJba.bC 


1 ^7 


^7 


1^7^ 
ij / j 




XlOIIlO bdpieilb 


\JL JLL-'.g X O-J i i X\J J 


7^A 


^R 

JO 


1376 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


132 


60 


1377 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


317 


51 


1378 


AF1 18082 


Homo sapiens 


PRO 1902 


102 


42 


1379 


AF 115549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


186 


45 


1380 


U93567 


Homo sapiens 


putative pi 50 


116 


38 


1381 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


218 


47 



165 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








alJXJUlal l\j pugO CIolIlCill 








D\J€>y 1 o 


jnomu bapicub 


ri inn aii seL/rexeu proieni sccjucuLe 


87 


oo 


1383 


AF090895 


Homo sapiens 


PRO0117 


68 


82 




i_j y J ~j 1 w 




pLttCiLlVC pi»7L/ 


1 78 

I/O 


oy 


1385 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7914 


132 


43 


1386 


AF130089 


Homo sapiens 


PRO2550 


142 


35 


loo / 


T 9749 R 


XXUIIivJ aapiCIli> 


reverse transcriptase 


1 

1 Oj 


AO 




AOlZr70 


IvaLLuo 


open reading frame 2 


1 9^ 
IZj 


AA 


11&0 


Hfi9£79 

UUZo /Z 


rxorriu bapienD 


jtiuinan secrcLea proiein, iu inv_/. 
6953. 


1 Jo 


/^7 


1 ^00 


V09A71 
X UZO / 1 


riorno sapiens 


numan secrexea proLcm encoaea oy 

optip 99 rlnnp T-HVTS TAX/1 % 
gciic uiuxie jnixvxo j vv x o. 


1^1 / 


3 / 


1391 


U93570 


Homo sapiens 


putative pi 50 


110 


28 


1 ^09 


rvyjy i 3 


noiiio Sapiens 


iNeurai inreaci proiein. 


1 U^f 


3D 


1393 


L27428 


Homo sapiens 


reverse transcriptase 


101 


39 




W 5U4UO 


Homo sapiens 


A secreted protein encoded by clone 

Hfi40 ^ 

OU'H-U 3 . 


14 / 


4 / 


1395 


AF2 16972 


Homo sapiens 


p8 protein 


118 


49 




U4yy /j 


Homo sapiens 


utvr i 3 MxiKJ /, putative transposase 
similar to pogo element 


loU 


£ o 
JO 


ljy / 


T T/1 DOT? 

U4yy /j 


Homo sapiens 
. 


L>Kr l , JVLbKj /, putative transposase 
similar to pogo element 


in 


/CO 

oz 




Hrui u 144 


Homo sapiens 


neuronal tnreaQ protein /vl* /c-in i Jr 


y / 


oz 


1 "300 
uyy 


ljuj iy\j 


rfomo sapiens 


rmni cut secrexea protein, oiz,^ iu rsKj. 
787 1. ^ ^ 


IOC 

1ZZ> 


<A 




.A.i/Z'H'O J 


X I dSil 1 U Ul UJ Jl 

vivax 


pval 


1 9d 




1401 


T TO'} 

LJ 7JJUJ 


rruixnj SapiCXIci 


puidliVC pi *J\J 


ij i 




1402 


X67863 


Mus museums 


T2 


160 


48 


XHVD 


1^09 ^7£ 


Homo sapiens 


salivary proline-rich protein 1 


QA 


oy 




UUo / 14 


Homo sapiens 


Human secreted protein, SEQ ID NOr 

770^ 


o4 


/ 3 


1 A0^ 


onn^^4 

vJUU->J*t 


riorno sapiens 


xiuiTiciii seLrereu pruiein, od^ iu 


7 1 


/ 1 


1406 


Ml 8094 


Phaseolus 


hydroxyproline-rich glycoprotein 


138 


31 


1407 


AF 1 343 04 


Homo sapiens 


Scar2 


118 


40 


1 A0R 


V0R0£1 
x UoUO I 


riomo sapiens 


juLuiiian c-ixiyo protein iragment. 


191 
1Z I 


Q9 
oZ 


14D0 


T JO"357A 




mitizti\7£* r> 1 
putaLIve piju 


1 70 




1410 


1593563 


Homo sapiens 


putative pi 50 


98 


43 


1/111 
i*+i i 


WAft^l 

WH03J 1 


riorno sapiens 


Fturriau ureasi cancer r eidteu proiein 
BCRB2. 


1 OA 
1Z^ 




1419 

1H- 1Z 




XiUIllO Sapiens 


nurndii secreteu prucem, oJC/^j ilj invj. 
7409 


1 AO 


JO 


141 3 


A F0 1 0 1 44 


nAJiiivJ sapiens 


llCUIUlldl UllCdU piOLCXll r\l J 1 V^-IN 1 x 




AR 


1414 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

S^97 


155 


82 


1415 


G0^967 


XiUJiJU Sapiens 


TTumpiTi cA<^i*p»i"f i rl i^Tr\'l"fi'tTi Q"Pr~^ Tl"^ "M^"^' 

XJ.Lilllail jCU CICU piULCilij OJ3v^ l>)v^. 

7348. 


09 

"Z 


^9 


1416 


AF119855 


1-Tfimn iar>{p , n < ; 


PRO 1847 


89 


70 


1417 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


34 


1418 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


99 


39 


1419 


AF130079 


Homo sapiens 


PR02852 


114 


69 


1420 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


133 


52 



166 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1421 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVI 


115 


44 


1422 


G03786 


Hnmo <i aniens 


Human ^rrptf^rl nrfrfeMn SFO TO NO* 

7867. 


111 


77 


1423 


R95913 


Homo sapiens 


Neural thread protein. 


128 


80 


1424 


L26953 


Homo sapiens 


chromosomal protein 


104 


34 


1425 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


51 


1426 


Y 19684 


Homo sapiens 


SEQ ID NO 402 from W09922243. 


98 


75 


1427 


U83303 


Homo sapiens 


lineal reverse transcriptase 


149 


40 


1428 


AF090895 


Homo sapiens 


PRO0117 


111 


75 


1429 


AF1 19855 


Homo saoiens 


PRO 1847 


88 


56 


1430 


AF229067 


Homo ^aniens 


PADT-H nrotein 


157 


SI 

J JL 


1431 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


1432 


U49973 


Pfnmn carii pn Q 

XJLKJLXXKs OupilviliJ 


Vy'AVT I 3 ivlJC-XVJ? / 3 IJUlClUvC LI aliaULfiiCl&C 

similar to pogo element 






1433 


AF01 01 44 

j\X I 1 


T-Tottia canlpriQ 

JLJLU11LU oAplCHj 


JLlCUHJllCil UJLl CdU JJIIJICJULI JrS.XJ 1 L.-1N 1 IT 


1 AQ 


S7 


1434 


AF161356 


Homo sapiens 


HSPC093 


180 


46 


1435 


TTQ1570 


XTX \JlllKJ aclLJlCllo 


JLytllctLl VC pi JU 


1 1 fs. 

x 1 O 




I'tJU 


O017R0 


T-Tfvm n c nr^i p»n c 
XJLUIILvJ odjJlCilo 


numaii bccrcicQ protein, oij/V,/ iu inu. 
7870. 


1/1 


/ D 


1437 


U83280 


Leishmania 

LLvJi IV/ VCH11 


39 kDa antigen 


106 


80 


1438 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874 


103 


73 


1410 


TT1 5647 


TV/Inc mucfMiluc 

1YJ,U£» illU-oOUlUb 


ICVCI I)C li ClllolsJL ipictbC 




AA 


1440 


L27428 


Homo sapiens 


reverse transcriptase 


78 


40 


1441 

LH*-t JL 


voiaoo 


numu bctpiciio 


v^/^rof ^>rl i -f*i r*f\ /~\ a/i T-vw Aran 1 O 

oewre ecu protein encoQea oy gene 15 
clone HNHF029. 


1 1 c 

1 1 J 


DO 




\J\JDO\J I 


nomo Sapienb 


riuman secreteQ protein, oriv^ iu invj 1 . 
7888. 


77 


03 


1443 






tAxil\j.lKjyvll Lsl islCXil 


1 J j 


i^7 


1444 


M11901 


Rattus 

J.HJ1 VCglL/U.5> 


proline-rich salivary protein 


102 


40 


1445 


Y02999 


Homo sapiens 


Fragment of human secreted protein 

p> Vi r~ o rf f=» H W\r nr<atip> 171 
C11L-VJU.CU. Uy gCllC XjLX, 


101 


66 


1446 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OJ70. 


125 


74 


1447 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OJOl . 


90 


66 


1448 


X7670R 


T^rncrvriVi il a 
X-Ji. wo vjL'ii i vet 

melanogaster 


Jpi^JtCUl J j bpCvll.IL CAOllb 


1 7^ 




144Q 




X lVJIllv/ acipiCUo 


ilUllldll bcUlCLCLl piULcHl XTV_/irir^P Do, 

SEQ ID NO: 163. 


1 7& 


70 




G040Q1 


JL J.U11J.V/ adjpldla 


XlUlllCUl bCLJCLCU piULCill, OJZ/V^J XLJ 1NU. 

8172. 


1 no 




1451 


S80119 


Rattus sp. 


reverse transcriptase homolog 


115 _j 


54 


145? 




XXUllLtJ adjylCliO 


JlUfflaH isCLJTcLCU prt/Ceiil, xjLjK,/ IU iyKJ. 

7795. 


7A 


Oj 


1453 


U42471 


TVhis mil sculus 


AVi<iP.ott- A IHrifVh Svnrlrnmp nrntpin 

YV lovWlL /xlvlx 1.^<1X O jf JXul UillC piwLCJJ.1 

homolog 


1 05 


40 


1454 


AK024455 


T Tnm n Rfmipn*? 


FT TO 0047 nrotein 




J j 


1455 


AC007258 


Arabidopsis 
thaliana 


Hypothetical protein 


105 


37 


1456 


AF194537 


Homo sapiens 


NAG 13 


208 


52 


1457 


U63542 


Homo sapiens 


FAP protein 


111 


84 


1458 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


111 


65 


1459 


AF090931 


Homo sapiens 


PRO0483 


84 


43 



167 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1460 


R95913 


Homo sapiens 


Neural thread protein. 


106 


69 


1461 


R95913 


T-frvtno ^Pinifin^ 

■LHJlliV/ OapiOlIO 


TsTf^lir^l tVirps^rl n rot pi n 

J.^v>U.l<lJ UUCCIU jJlULCIli. 


100 

i \jy 


40 

<4*\J 


1462 


U93564 


T-Tomo aniens 


Tentative r> 1 50 




49 


1463 


AB029309 


Homo saniens 


TsTow^ R-H 5 nrl in u nrnf Pin ISJnwT-iP 
iijjvv jo \j 11 1 vj in si ui vjlo m i^pvvi_JA 


97 


^7 


1464 


U44838 


Glycine max 


extensin 


97 


33 


1465 


AT 050341 


Homo , s?inipn«; 

AXvFlllvJ OCipiv/liO 


UJ J 7vJZ-x6. i Cell 1 dllgCU Lj~lHy\j HlolLUl 

sequence (ZN-15 related zinc finger 

JpA v/ IVriAl y 


191 


45 


1466 


L27428 


Homo sapiens 


reverse transcriptase 


94 


34 


i^rL/ / 


AF01 0144- 


T-Tomo QPir*>i p~n Q 

JlLKsIIHJ OCljJllslLj 


JlCul CilaJ Ifli Ca.il piUlClH r\LJ / t 1 x 1 JT 


1 1 9 
Lit 


JO 




V17991 

1 1 t Z.Z. L 


mjnnj octpic±ii> 


n.uiiit±ii scL/itJitJu prc/Lcixi ^oioxic ikji /- 
3). 


OS? 




146Q 


oo^70£ 


Hatha c nnj pn c 
snvjiii u» <»cipicii£> 


XJLlllIla.il i>t?L,r cieU prOLcIII, OJCv^ IJL^ 

7879. 






1470 


00^700 




XiULlIIaU iCUIClCCI piULCHlj OJL-V^ 1J_> IN^. 

787 1. 


1 An 

14/ 


0 / 


1471 


AF1 00007 


iTVJl I lU bcipiClia 


<s1 ^4 






1472 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 


142 


36 


147^ 


VR694R 

X ODZno 


XXUIIIU odpioHb 


riurndii secreiea protein Jti^riJrrOo, 


1 ^0 

1 jZ 


yy) 


1474 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

79 1 4 


157 


45 


1475 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

olillllal isj pugu ClClllCUl 


398 


58 


1476 


U49974 


XXVSJllU JuUlClJo 


lllctl 111C1 11 a.1 1J> UUoalbC 


90 1 




1 477 


T T40074 


L L\Jiii\J adpiciio 


iiicti nici li aiispvJaabc 


90A 
ZOO 




1478 


W48351 


Homo sapiens 


Human breast cancer related protein 

RPR R9 


117 


72 


1479 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
79 5^5 


146 


82 


1480 


U93567 


Homo sapiens 


putative p i 50 


202 


42 


1481 


T<T09^76 


rxvniitJ ocipiOllo 


bcuivcuy proime-ricn proiem i 


1 ni 


4o 


14S9 


T TK7607 


~R affile 


JpULctLlVC 1VLN7 L V UlllUHlg piOLClll 1 


1 AO 

J.UU 


J / 


1483 






rr 1 1 m ^ n s?F»prpff»f1 r>rrvff*in ^TnT^i TT"^ TsTf^V 

7871. 


1 94 


7S 


1484 


G00328 


T-fnmn ^artiftn<; 

llUUiU OCipiv/J.io 


T-Tiim?»n Qprrp+prl rtrntpin QPT^> TT^ "KT^* 

JLXU.I11CUJ piwLCUl, OJCiV,/ 1 J_y 1NV-/. 

4409. 


1 OR 
1 uo 


n9 


1485 


U15647 


Mus museums 


reverse transcriptase 


115 


73 


1486 


AF194537 


Homo sapiens 


NAG 13 


132 


42 


14R7 


yi i 


lV/l 1 1 0 m nc r*i 1111c 


proime-ricn sail vary proiem 


1 1 0 
1 1 0 


4U 


1 ARR 


nn^ 1 i*> 

UUJ 1 J>-3 


JTlvXllivJ bcipiCllb 


rxuinaii secreieQ proxem, ojz.^ ilj lskj. 
7214. 


1 no 


04 


1 4RQ 


r\r UU7OO0 


Hi 111 Lip 1C 

sclerosis 

nQcor-ipfFr! 

CLJ j KJ Kj 1 CI I ti «-l 

rfttrnviriK 


puiyproLeiri 


1 in 


/I Q 


1490 


AK 023 542 


l-Tonnn ^anip.riQ 
x i.wiiiVj' aapiv/iia 


nnnsin^H t"vr/vfrf»in ofrvrliift 

U-i J 1 1 Cl\ V 1CU. pUJLClll pi KJ LI LIU L 


114 


^7 


1491 


G03786 


xx\jiuvj» jcipiv^iio 


T-Tiimfm Qpprpfprl rtrof pin ^IPO TT^ ~KIY~V 
XlU.llla.ll auuiCL^U pUJtOlllj ODy LxJ JlNV-J. 

7867. 


1 


50 


1492 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


184 


47 


1493 


Y12713 


Mus museums 


Pro-Pol- dUTPase polyprotein 


222 


53 


1494 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


50 


1495 


AF109907 


Homo sapiens 


S164 


259 


45 


1496 


R95913 


Homo sapiens 


Neural thread protein. 


110 


51 


1497 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


299 


80 



168 



WO 01/64835 
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SEQI1> 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


14yd 


OUjSZDd 


Homo sapiens 


fiuman secreteu protein, oxi^ iu rsw. 
7339. 




65 






Homo sapiens 


chromosomal protein 




0/ 


1500 


AF090895 


Homo sapiens 


PRO0117 


145 


68 


1DU1 


T TO^ ^T9 

uyjD /z 


Homo sapiens 


p4U 


1 Id 


42 


1502 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

S2 1 TO 
O 1 /Z. 


133 


50 




kjVZjv) ! 


— — ; 

Homo sapiens 


Jriuman secreted protein, dca^ NV. 
6588. 


143 


56 


1DU4 


OUUdZo 


Homo sapiens 


xiuman secreted protein, oJca^ il) INO. 
4409. 


lift 
1 19 


69 




WUUoJo 


Homo sapiens 


Tumour necrosis factor-related gene 
prouuci. 


1 A< 


5z 


1506 


AF 109907 


Homo sapiens 


S164 


184 


43 


1DU / 


T T/1QQ7'3 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


TOO 

Zoo 


O / 


1 JUO 


JtvljjjO 


-— : : 

Homo sapiens 


Protein encoded downstream of hlic M 
oncoprotein. 


1J 1 






AJtvUUOZ^l 


Homo sapiens 


unnamed protein product 


lo / 


/Z 


1 CIA 

idiu 


OUZD3Z 


Homo sapiens 


Human secreted protein, ol^v^ ID NO. 
6613. 


1 j3 


60 


i <i i 


QA'7Q9Q 

c>ozyzo 


Homo sapiens 


PRB1M protein precursor 


1~> / 


3y 


1D1Z 


AdUIZZZd 


Canis familiaris 


UKrz 


1 10 


40 


1513 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ooo. 


109 


59 


1514 


AF220264 


Homo sapiens 


MOST-1 


108 


80 


1515 


X53581 


Rattus 
norvegicus 


ORF4 


96 


44 


1516 


V00662 


Homo sapiens 


cytochrome oxidase III 


433 


74 


ID 1 / 


t T/inmi 

U4yy 13 


Homo sapiens 


(JKr 1; JvlbKi /; putative transposase 
similar to pogo element 


263 


70 


ID lo 




Homo sapiens 


ORF1; MJEK37; putative transposase 
similar to pogo element 


o zr zr 

266 


70 


1519 


M24732 


Homo sapiens 


lamin-like protein 


107 


44 


i con 
1DZU 


UU3 133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


131 


51 


IDZ1 


W4o3Dl 


Homo sapiens 


Human breast cancer related protein 

rSL^ivtSZ. 


O 1 

ol 


68 


1 DZZ 




xiomo sapiens 


"NT a n 1 1 
IN AO 1 d 


1 lo 


4Z 


1DZD 


A:?Z4oD 


Plasmodium 
vivax 


pval 


Q< 
OJ 


/lO 

4Z 


1524 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
si 79 


125 


50 




AR04179R 


±1\J1A1\J DcipiCllj 


VJ pi L/LCJIJ-L'UUpiCU 1 CUCpiUl X VjXv" I 


zzu 


1 on 


1526 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


138 


70 


1 o"97 

1 DZ 1 


UJZU/ / 


nomo Sapiens 


mariner transposase 


nil 


DO 


1528 


L27428 


Homo sapiens 


reverse transcriptase 


189 


40 


iDzy 


I ODJ /J 


Homo sapiens 


, Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


355 


no 

7o 


IDdU 


T T3^1f\ 


IVIus musculus 


IFN-response element binding factor 2 


yi) 


j/ 




OUUiD4 


Homo sapiens 


Human secreted protein, oJbQ iU NU: 

44 3 


90 


64 




TT1 19X8 


Lsl U&Vjpiil lei 

melanogaster 


alapnaiious protem 


lJO 


JO 


1533 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


429 


63 


1534 


L27428 


Homo sapiens 


reverse transcriptase 


249 


55 


1535 


U93570 


Homo sapiens 


putative pi 50 


114 


31 


1536 


AF130089 


Homo sapiens 


PRO2550 


111 


47 



169 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Idetitit 










Score 


y 


1537 


X55687 


Lycopersicon 


extensin (class II) 


s- o 


28 






esculentum 








t c o o 

1538 


T T 1 C H A 1 "! 

U 15647 


Mus musculus 


reverse transcriptase 


1 10 


42 




A1VI/Z4455 


Homo sapiens 


"T?T TA AA/f7 H u A ^, — . " u 

rLJi)0{)47 protein 


lOA 

139 


80 


1540 


003062 


Homo sapiens 


Human secreted protein, SBQ ID NO: 


128 


48 








»71 jIO 

7143. 






1541 


AC024788 


Gaenorhabditis 


Hypothetical protein Y46E12A.d 


80 


46 






elegans 








154Z 


W48351 


Homo sapiens 


Human breast cancer related protein 


"t o o 

13Z 


73 








X5CKJ3Z. 






1545 


WDUlyJ. 


Homo sapiens 


Amino acid sequence of salivary 


OA 

90 


o o 








protein CON- 1 . 






1544 


LrUzy / 1 


Homo sapiens 


Human secreted protein, MiQ iju> NU: 


/ 1 


63 








/lOZ. 










Homo sapiens 


Neural thread protein. 


1 1 $ 
IIS 


55 


1546 


Dool 16 


Pan paniscus 


cytochrome c oxidase subunit 1 


218 


78 


154 / 


Uioi 1Z 


Homo sapiens 


cytochrome c oxidase subunit 1 


^70 


71 


1548 


W4835 1 


Homo sapiens 


Human breast cancer related protein 


1 19 


65 








BCK±>Z. 






1 C A A 

l54y 


A TA f\A O 1 A 


Zea mays 


cytochrome P450 monooxygenase 


1 A A 

140 


70 


1550 


AF1 13685 


Homo sapiens 


PRO0974 


115 


47 


1551 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


172 


75 








SEQ ID NO: 163. 






1552 


Y91577 


Homo sapiens 


Human secreted protein sequence 


366 


94 








encoded by gene z ShQ ID N 0:250. 






1553 


W/A H1 1 Q 

W4U1 1 J 


Homo sapiens 


Human alpha-2(IV) collagen protein. 


i n 

1 17 


TA 

59 


1554 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


76 








a a r\o 

4493. 






1555 


U 42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 


1 13 


40 








homolog 






1556 


D381 1j> 


Pan troglodytes 


cytochrome c oxidase subunit 1 


502 


89 


1557 


AF130089 


Homo sapiens 


TVTi /~i<o c c r\ 

PRO2550 


98 


55 


1 c c o 

1558 


Y08062 


Homo sapiens 


t t t~»t» /*^io a tr _ J ■ , 

Human PR0245 protein fragment 


109 


43 








derived from DNA35638, 






1559 


U49973 


Homo sapiens 


/*vi%r<i ik ■ri - »T^o'0^ . . ■ , 

ORF1; MER37; putative transposase 


183 


52 








similar to pogo element 






lDOU 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


180 


52 








similar to pogo element 






15o 1 


Arz0z&93 


Mus musculus 


TV" I -P^ 1 "U 

Kiiz 1 b 


254 


85 


1 562 


M634Z1 


Drosophila 


csp32 


104 


39 






rneianogasier 








1505 




Homo sapiens 


Human secreted protein, SEQ ID NO: 


129 


69 














1D04 




— ; 

Homo sapiens 


UKr 1 , Mlito /, puiative transposase 


AO 

yo 


5y 








similar to pogo element 






1 565 


A T1 <JO 1 A/C 

AJ15Z1U6 


Bos taurus 


bCO-spondm 


1 14 


A r\ 

40 


IDOO 


AjLoyUl 14 


Leishmania 


extremely cysteine/valine rich protein 


1 1 A 

1 iv 


66 






major 








1 JO / 


Ar 1O1350 


Homo sapiens 




1 AA 

100 


38 


150o 


ATI 1 QOC1 

Ar 1 lyi>51 


Homo sapiens 


TVr> r~\ 1 lOI 

JrKUl /ZZ 


QA 

94 


ha 

11 


1569 


L27428 


Homo sapiens 


reverse transcriptase 


107 


48 


1570 


X99451 


Lycopersicon 


extensin-like protein DiflO 


104 


32 






esculentum 








1571 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 


126 


59 








clone HNHF029. 






1572 


X7348I 


Drosophila hydei 


mstl01(2) 


105 


41 


1573 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


133 


50 






elegans 


binding protein K10 (NID:g8148) 






1574 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


154 


51 



170 



WO 01/64835 
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SEQ 10 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








© 1 A A 

8144. 






1_> /D 


r»i oi n 
Dioi iZ 


Homo sapiens 


NADH dehydrogenase subunit 4 


323 


83 


1576 


AF062008 


Caenorhabditis 
elegans 


unknown 


111 


54 


1577 


X924S5 


Plasmodium 
vivax 


pval 


81 


57 


1578 


U93570 


Homo sapiens 


p40 


102 


33 


1579 


Ar 1)90944 


Homo sapiens 


TVT> AA/T ^'J 

PRO0663 


132 


59 


1580 


AL 137798 


Homo sapiens 


dJl 182A14.5.1 (novel gene (isoform 
1)) 


182 


53 


1581 


X92485 


Plasmodium 
vivax 


pval 


120 


44 


1582 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


160 


47 


1583 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


135 


54 


1584 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


119 


52 


1585 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


82 


55 


1586 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ir\n c 
70 ID. 


152 


60 


1587 


D28482 


Homo sapiens 


SCR2 


390 


83 


1588 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 


59 


1 con 

1589 


A T? A 1 A 1 A A 

AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


117 


50 


1590 


AF1 18078 


Homo sapiens 


PRO 1848 


118 


59 


1 C A 1 

1591 


CjrUU9o5 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5066. 


56 


78 


1d9z 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ 1^4. 


118 


80 


1593 


U93571 


Homo sapiens 


p40 


170 


77 


094 


Y YZ / 13 


Mus musculus 


rro-Pol-dU 1 Pase polyprotem 


230 


51 


159j 


L2 /42o 


Homo sapiens 


reverse transcriptase 


138 


45 


1596 


X97707 


Pongo pygmaeus 
abeln 


stopcodon created by 
posttranscriptional polyadenylation 


139 


61 


159 / 




Homo sapiens 


COL1A1 andPDGJbB fusion transcript 


107 


31 


159o 


002480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


97 


58 


1599 


AF2 10651 


Homo sapiens 


NAG 18 


86 


89 


1600 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


148 


48 


1601 


J03770 


Mus musculus 


homeobox protein 


99 


35 


1602 


AF1 19901 


Homo sapiens 


PR02831 


119 


56 


1603 


AL031673 


Homo sapiens 


dJ694B14.1 (PUTATIVE novel KRAB 
box protein with 1 8 C2H2 type Zinc 
finger domains) 


233 


44 


loU4 


kjkjd / 14 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


66 






Homo sapiens 


Human secreted protein, SbQ ID NO: 

11 A Q 


89 


44 


ioUO 


L/oo4o 1 


— — 

Rattus rattus 


JN-WAbr 


1 T5 

12j 


A U 

43 


loU / 


Ar uyuy4.z 


Homo sapiens 


T>T> r\ A/C<T 


107 


61 






lYius muscuius 


jerky 


lVD 


J4 


1609 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


108 


45 


1610 


B06334 


Homo sapiens 


Human subtilisin-kexin isoenzyme 1 . 


474 


84 


1611 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


101 


90 


1612 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


55 


1613 


D86853 


Catharanthus 


extensin 


123 


39 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






roseus 








1VJ It 


AF1 19R51 

-TT.1 1 l/OJl 


T-Tnmn Q?inipn ^ 
iiunnj aapiciij 


PR0 1 777 


1 

1 DJ 


jy 


1615 


Y01 158 


1-Tnmo ^finien 5 * 

JLIUiiiv □aplvxii> 


clone HCAC J8 1 . 


147 


^7 


1616 


AF1 94537 


Homo sapiens 


NAG 13 


154 


63 


1617 


AF1 19851 


Ffomo ^aniens 


PRO 172? * 


y l 




1618 


W48351 


Homo sapiens 


Tinman hrp^^t nan^pr rplatprl rvrrvfpin 

BCRB2. 


97 


4R 

Ho 


1619 


AK024455 


Hfimo ^aniens 


FLJ00047 nrotein 


1 47 




1620 


AF2 17973 


Homo sapiens 


unknown 


116 


67 


1671 


AF000798 


f 1 si p n fM*1i 51 \> H 1 + 1 c 
v^acinjj iiauuitio 

elegans 


w cats, oiiiijiai liy t»uiia^cxii>, giycine- 
and proline-rich 


1 fk7 
1 OZ 


HI 


1677 




XJ.UII1U odpiCIlo 


sd.li vary proime-ricn protein i 




An 


1623 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

81 4£ 


135 


44 


1624 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
siiiiiiar to pugo ciemem 


322 


62 


1 67^ 

1DZJ 


T T4997^ 


nomu sapient> 


v^jtvr i , iviiz/i\j) / , puiative rransposase 

cim liar fr\ rtr\cxr\ p>l/3mp>r>f" 
oJUlllIa.1 l\J jJUgU C1C1L1C11L 




/Z 


1626 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 

UJLlL>UjJJ v/LClll. 


113 


65 \ 


1627 


X92485 


Plasmodium 
vivax 


pval 


90 


45 


1628 


AF090RQS 


T-Trimrw oanif^riQ 
-Tiu J. isdjJldlo 








1629 


AF1 16661 


Homo sapiens 


PR01438 


87 


54 


1 610 
IDjU 


1V1 1 O 1 uu 


11U1 VCgJLL/U.Jj 


unknown protein 




7o 


1631 






rxLuiiaii scucicu piutcni, OJ3 LLJ iNv. 

7284. 


1 47 

1 H / 


i^n 
ou 


1632 


AF119851 


Homo sapiens 


PRO 1722 


107 


70 


1633 


M647Q7 


n orvegi cus 


Qfil i\/?it^/ i^rPilin P'~vir > l> t^rr^tf^iTi 
oaiivcuy jjj uimc-i piULClll 






1634 


L27428 


Homo sapiens 


reverse transcriptase 


109 


38 


1635 


AFT 1 R0R7 




PT*m 007 




H\J 


1636 


R95913 


Homo sapiens 


Neural thread protein. 


118 


88 




no74oo 


JCiUlllU bdpiCIlb 


j^uiiid.ri seurexeu protein, otiv,^ iu invj. 

DJ / 1 . 


fy 


tin 


1638 


U93570 


Homo sapiens 


putative pi 50 


128 


54 


16^0 




JuLomo sapiens 


ORP1 j MER37j putative transposase 
similar to pogo element 


iZ / 






TT4997^ 


JT1UI11U odJJlCllo 


v/j\r i , ivi£jx\.j j , puidu ve Transposase 

similar Trv nocrr> pi ptti p»r>t 


1 7 1 
1/1 


DO 


1641 


AF194537 


Homo sapiens 


NAG 13 


140 


66 


1647 






Jl. Ulllall SCU1CLCQ pit/LCllI, oHV^ 1 1 J lyyj. 

6574. 


1U 1 


^o 


1643 


M64793 


Rattus 

rmrv6*crteu<? 

A1VJ-1 V V/giV/ Uo 


salivary proline-rich protein 


117 


33 


1644 


AF010144 


TTr>mr» sarjipn^ 


t rnn^ ] tKrpJiH nrntf^in A 07r*_7\TT^P 
iJtcuiuiiai till cau r\_LS f t IN ir 


1 7^ 
i^j 




1645 


G03787 


Homo "^aniens 


Human «iPcretpH nrotpin ^FO TT~> "MO* 
7868. 


96 


70 


1646 


L27428 


Homo sapiens 


reverse transcriptase 


86 


84 


1647 


X92485 


T*1 a tLTn nH inrri 
A icimiiuu.iliiii 

vivax 


pval 


1 ^7 


H\J 


1648 


U15647 


Mus musculus 


reverse transcriptase 


93 


68 


1649 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


41 


1650 


AF1 16712 


Homo sapiens 


PR02738 


107 


57 


1651 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


139 


72 . 


1652 


X05472 


Rattus 


ORF3 


84 


51 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






norvegicus 








1UJJ 


G0058£ 


TTr» mo c^TT-iRn <? 

XXUilHJ' ia.piC/liO 


XlUIifCUi iCU UlCU piULdllj OJUrV^ iJL/ iNV-/. 

4669. 


177 


/ 1 


1654 


U93566 


Homo sapiens 


p40 


117 


52 


1655 


/TLX jL, i / *JJ\J 


A XUllJlU oCtl-JLvsllO 


11 UllL/dtV^VJ. lilC V cLl.\JliaL\J iS.lllCloC 


141 


7fi 


1656 


AF090895 


Homo sapiens 


PRO0117 


125 


60 


16S7 

xOJ / 




PI p cm r\ r! ii i tn 
AlaMllvJUlUlll 

vivax 


pval 


1 1 A 
1 l^f 




16SR 




XxOlliO oapieiib 


neuroncii uireao. protein / c-in i jt 


1 Q7 


ol 


1659 


Y86473 


Homo sapiens 


Human gene 52-encoded protein 

xTdgllieill., oxl\^ IxJ 1 lNv_^.Joo. 


73 


30 


1DDU 


V01 S77 

i y 1 J / / 


riomo bapienb 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


-5 J i 


/4 


1661 


VJUJ't-JO 


riomo bapicnb 


xiuman secretea proiem 3 oxiv,; id nkj. 
7S1 0 

/J17. 


LZ f 




1662 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
biiniioi lo pogo clement 


312 


67 


166^ 


OQA1 10 


iVaiLub o\J. 


revcrbc iraiibcripiabe nomoiog 


oo 


jy 


1664 


U43360 


Peromyscus 

■m om Iodic 

ill all 1 tllat Ub 


reverse transcriptase 


106 


45 


1665 


M76729 


Homo sapiens 


pro-alpha- 1 type V collagen 


172 


47 


J.OOO 


W4ojj 1 


Homo sapiens 


Human breast cancer related protein 


y / 


54 


1667 


AF 1693 88 


Mus musculus 


alpha 4 collagen IV 


84 


38 


lOOO 


VT 7/1 

W4o3j1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


84 


66 


1 660 

1007 




Homo sapiens 


ORF1; JV1ER37; putative transposase 

Oirvnicn 4 ir\ Y\r\rrt~\ <al*arr^<2i"r»+ 
blUiiicll LO pOg,0 CiClilCIlL 


4Zd 


OA 


1670 

ID l\J 


MT7Q677 


IV/Ttic miier , nlnp 
IVlUa IIIUoULllUb 


open reduing irdme a 


/4 


40 


1671 


W90838 


Homo sapiens 


Human lymphocyte targeted peptide 


98 


100 


1677 




nuuKJ sctpiciii 


nuriiaii bccreieu protein, oxiv^ id l\\J. 
4400 




JO 


1673 


AF090931 


Homo sapiens 


PRO0483 


72 


39 


1674 


7&7 


±HJilHJ aa.pi&lL£} 


UlapilctOOLii> 1 




ACk 


1675 


U J / JOl 


XVCllLU.J> 

n ofVf^oi pii q 

11LM VCglOUo 


COlia^Cll yVll d.ijJlld. 1 




4o 


1676 


AF1 82844 


Horn o QaniprK 


VPS7R nrotein 

V I l_>^1<€j V/ LC 111 






1677 


L27428 




ItVClOv 11 CUlOwi ipiadv 


1 SO 

1 o!7 


47 


1678 


VJ V_7 J _J .Z, 


T-Tnmo QanipnQ 

11U111U ijClLJlVjlXO 


Hfnman cppr^tpH nmtp'iTi QTn O TT^) "MO* 

ATLU.111CIH SCtlCLCU UlUlCUl, OIjV^ XJ--' 1NV_^. 

6613. 


1/17 




1679 


U93565 


T-f nm o camipric 


Tvntativp "1*1 1 ^>0 


71zt 




1680 


AK00^129 


I-fomo canten*; 

llVlllv ijClLylwlAO 


nnnflttipH rimtpin rvroH nrt 

UAlllClllieiJ. piUlwlli pi UUUUL 


17S 

1Z.O 


S7 


1681 


X03145 


Homo sapiens 


pot. ORF V 


93 


48 


1682 




A/TllQ TYlUQfll 111*1 
IVlUo UlLloU LI I Lib 


nrrvliTif*— t*ir*Vi YvtvYhf^in 

pi wllllC - 1 It/Il pi \J UClll 


1 0S 


JO 


1683 


AF 11 8082 


Homo sapiens 


PRO 1902 


117 


42 


1D04 


T? 1 3SS6 

JtVl D J JO 


riomo bapicllb 


x roiein enoou.eti u.o wnsueam 01 nnc ivi 

\JllUUpi \J LC ill . 


1 0 / 


JO 


1685 


R95913 


Homo sapiens 


Neural thread protein. 


92 


66 


16fl6 


n094SS 


Wnmn caniVnc 
jnUllHJ aopiCilb 


xxuiiiaii aecrcLeu pioi.eiii, onk^ IjL/ lNi-*. 

6566. 


hi 


£1 

01 


1687 


X61296 


Rattus 
norvegicus 


open reading frame 2 


104 


38 


1688 


AB012223 


Canis familiaris 


ORF2 


98 


39 


1689 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


278 


72 


1690 


U52077 


Homo sapiens 


mariner transposase 


175 


56 


1691 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


83 


46 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1699 


AF061 198 


x xctaixl UUI UX1I 

falciparum 


tnprr>7Al c l ! vFq r* /a r>T"/~»i"/airi 1 
ILiau^UllC bUrxaCt; piv/tcxxi I 


oj 


A A 


IvyJ 


X77799 




1I1LCI Xci Oil dxpncl/ Dcid IcLpcptUX 


oy 




1694 


Ml 3 100 

1VX I J 1 VyL/ 


norvegicus 


uiiisjivvvii pioicm 


04 

y** 


/in 
4U 


1695 


AF90963S 




PP1900 

X X 1 jL*\J\J 


114 

1 IS- 


ou 


1696 


AK001116 


Homo sapiens 


unnamed protein product 


127 


53 


1 607 
VOy / 




nomo sdpienb 


Human secreted protein, SEQ ID NO: 

44X4 




5^ 


1 698 


AF1 1 R07R 


rxixiixu acijjiciio 


PRH1 84R 
jt iwy i c+o 


03 

yj 


4Z 


1699 


X92485 


Plasmodium 

vxva/v. 


pval 


147 


49 


1700 

X / UVJ 


M63R1Q 


Jr laMIlUlilUliX 

falciparum 


iiid.id.rid. dnugen 


TO? 
xUl 


CA 
D4 


1 701 

x / u x 


AF090Q30 


numu aauiCUa 


PR O047R 




to 


HO? 






t/UUdgCil dxv,v^ 


y*\ 


Af\ 
4U 


1703 


AL390114 


Leishmania 

IlldJUI 


extremely cysteine/valine rich protein 


169 


66 


1 704 

X / \J'-r 


A F 1 30089 




r>x> O9S^0 

jtSSXJJ, u 


1 /1C 




170S 


ycaii 3 

AOJt- 1 J 


T-Ti i tyi n n 
nuindii 

Vipropcvimc 6 

11 CI LJCb V IX Ut> \J 


Uoo 


1 1 ^ 


JO 


1706 


S60088 


XXUlilvJ oajJlCllO 


pULdLlVC aClllClslUll IIUJIUCUI^ — /A VJ ivl y\ 


1 ^ 1 
i J 1 


50 


1707 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
S397 


176 


52 


1708 


AF1 30079 


xrxtjintj ociljiciio 


PR D98S9 


1 7A 
1 / 




1709 


Y28682 


Homo sapiens 


Human pp392_3 secreted protein. 


557 


99 


1710 
J. / 1 U 


IVn 4473 


Mus musculus 


pro-alplia-1 type I collagen 


1 VZ 


34 


1711 


D13623 


Rattus sp. 


p34 protein 


128 


45 


T71 7 
1 / IZ 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
8144. 


64 


A 1 

41 


1713 
I / 1 J 




Homo sapiens 


plakophilin 2b 


1Z 1 


60 


X / 14 




Homo sapiens 


Human secreted protein, SEQ ID NO: 

7^34 


1 AO 


C 1 

3 1 


1715 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
bimiiat to pogo element 


171 


48 


1716 


0003^7 


JTHJHIU adpiCllb 


rxLiiiidu bcureicu procem, ocy id lskj. 
4438. 


QQ 


OD 


1717 
x / x y 


QQ01 1Q 




rcvcrbc irdnscripLase nomoiog 




-> / 


1718 


L27428 


Homo sapiens 


reverse transcriptase 


103 


46 


1710 


(3-037Q0 


xiomo sapiens 


riuman secreiea protein, oJc/i^ lu invj. 
7871 






1720 


AF130089 


Homo sapiens 


PRO2550 


106 


35 


179 1 




xioino sdpiens 


riuman secreiea proxem, oiiv^ ru invj. 
7733. 


1/1^7 

14/ 


4^ 




H03703 


XxUlIlU odpiCllo 


numdn secreiea proiem, oiti/^ inl/. 
7784. 


111 
111 


1 Z 




A F0 16099 


A/TlIC TTII ICfMI lltC 

ivxuo iLiut>ou.iU£> 


cnuoii u t/ icdb e/ re v er sc ixdiis cnpias e 


1 39 


OU 


1724 


X92485 


Plasmodium 
vivax 


pval 


96 


62 


1795 


T T49973 


xlUJUU iidpiCUb 


v^iTvr i 3 ivx-o£w /, puidiive transposase 
sixxiiidr to pogo eiemenx 




JZ 


1796 


MS 1 391 


lvxdL>a.L/a 

fascicularis 


piuiiiic-i icii pruiem 


idZ 


A K 
43 


1727 


G02507 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6588. 


119 ; 


69 


1728 


U93564 


Homo sapiens 


p40 


129 


58 


1729 


U93574 


Homo sapiens 


putative pi 50 


113 


76 


1730 


AF130089 


Homo sapiens 


PRO2550 


136 


61 


1731 


L05608 


Cercopithecine 


glycoprotein gl 


100 


46 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 






nerpes virus z 








1 /JZ 


U 1304/ 


Mus musculus 


reverse transcriptase 


1 -» o 
1 JO 


36 


1 /33 


Uy53 /4 


Homo sapiens 


putative pi 50 


1 Q7 




I 734 


CjUJ 714 


Homo sapiens 


Human secreted protein, !SbQ ID NO. 


155 


64 








/ /yD. 






1 7Q c 
1 /JO 


i UZyyy 


— — : 

Homo sapiens 


Fragment of human secreted protein 


1 A7 


£LA 
04 








encoded by gene 121. 






1 no. £L 
1 /JO 


VAOOOQ 

lxJZyyy 


— — ; 

Homo sapiens 


Fragment of human secreted protein 


1 1 Q 


A 

64 








encoded by gene 121* 






i ion 


f~\(\r\A OA 


— — z 

Homo sapiens 


Human secreted protem, ohKl \L> JNU. 


1 1 A 

1 1U 


58 








43 / 1 . 






1 7"3R 

1 / JO 


ATrnonozt7 
/\.ruyuyHZ 


riomo sapiens 




1 OD 


cc 

JJ 


1 73Q 


r n 1 7££ 


urosopniid 


diaphanous protein 


1 AQ 

i4y 


40 






meianogasier 








1 7/1 A 
1 /4U 


T 77/175 
i-,2 /4Zo 


Homo sapiens 


reverse transcriptase 


Wo 




1 7A1 


f7A7/l£C 


Homo sapiens 


Human secreted protein^ SEQ ID NOr 


7< 


34 








ODOO. 






1 7/17 
I /HZ 


VI 771 "2 
I 12 f 1 J 




Mus musculus 


Pro-Pol- dUTPase poly protein 




3y 


1 /4J 


ACUU3o82 


Homo sapiens 


R28830_l 


179 


63 s 


17/1/1 

1 /44 


X65165 


Volvox carteri 


extensin 


173 


49 


1 T/1 C 
1 /43 


LjUU3Z5 


Homo sapiens 


Human secreted protein, SbQ ID JNU: 


1 14 


62 








4409. 






1 /4o 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


89 


41 








6720. 








Cj0j>787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 15 


42 








noco 
/005. 






1 HA Q 

1 /4o 


t too c/^c 


Homo sapiens 


putative pi 50 


IOC 

125 


37 


1 1A Q 

i /4y 


OU4U0/ 


Homo sapiens 


Human secretea protein, bxii^ ID InU. 


1 O 1 

121 


A C 

45 








Q 1 A Q 
O 14(5. 






l / jU 


i uzy yy 


— — ; 

Homo sapiens 


Fragment of human secreted protein 


1 3y 


33 








encoded by gene 121. 






1 / J 1 


A T7A 1 A1 A A 

Ar U 1 u 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


o / 


OA 


info 

1752 


L27428 


Homo sapiens 


reverse transcriptase 


133 


42 


1 /33 


T TOO CTA 


Homo sapiens 


putative pi 50 


12o 


57 


1 /34 


T TOO' 1 

UZzj /O 


Homo sapiens 


alternatively spliced product using exon 


134 


50 








1 1 A 






1 TC C 


U4yy /j 


Homo sapiens 


vJJKJr I ; MxiKJ / ; putative transposase 


1 

175 


76 








similar to pogo element 






1 /JO 


AFUUOoio 


Oryza sativa 


similar to RJNG-H2 finger protein 


i i 


/TA 

60 








T> TT A Ira / A T?f\HQCQ'2\ 

JKxlAia (ArU/oooJj 






1 7C7 


T TC/179Q 
U34 /65 


— 

Mus musculus 


Wiskott-Aldrich Syndrome Protein 


1 CO 

13& 


A A 

44 


1758 


Y86579 


Homo sapiens 


Human gene 92-encoded protein 


109 


88 






_ ~ . 


tragment, oxiv^ id JNL>.4yo. 






i 7^g 


ouzyzu 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


1 T2 

I2^> 


15 








/UU1 . 






1 7£A 

I /ou 


i oOj fy 


_ . 

Homo sapiens 


Human gene 92-encoded protein 


1 1 J 


AO 

yz 








iragmeni, oni^ ils iNL^.'+yo. 






1 /ox 


yooj /y 


— _ j 

Homo sapiens 


Human gene 92-encoded protein 


1 1 A 
110 


O 1 

81 








rragmenx, oxs^ li-' lNLJ.Hyo. 






1 7£0 
1 /OZ 


TTAQA7A 


~im 1 

JVIus musculus 


collagen pro- alpha- 1 type I chain 


1 AC 

1U3 


1A 
34 




uu^ /yu 


Homo sapiens - 


Human secreted protein, SEQ ID NO: 


OA 

yu 


43 








/Of I. 






1 764 




riomo sapiens 


puiauve pi^u 




3D 


1765 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


116 


45 








SEQ ID NO: 163. 






1766 


X02873 


Daucus carota 


put. precursor 


112 


47 


1767 


X92485 


Plasmodium 


pval 


100 


45 






vivax 








1768 


R95913 


Homo sapiens 


Neural thread protein. 


96 


65 



175 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


T7AO 


nnr\6^7 
uuuoj / 


xiomo sapiens 


xiuman secretea protem, otv,; iu inlj. 
47 1 R 


114 


51 


1770 


X97675 


Homo sapiens 


plakophilin 2b 


115 


70 


1771 


000637 

vjUuvJj / 


T-Tr»mri vii f*y~\ q 
jnuiuu £>apiciJo 


XxUiJlaJi SCU1 CICU. piULCJJl, kJXJi\f lJ-J J.NU. 

4718. 


1 ^1 
i i 




1772 


G02532 


Haiti a ^arvfpri*? 


Hum an < ?p-rTf k tp , H nrnf Rin SFO JY\ TsTO ■ 
j-xuj.ixaii ocu cicu pi utt/iJij jll^ 

6613. 




OJ 


1773 




Wattia Q?)r>i^n <? 

XxUlilU OilpiCili 


PR025 50 


l JO 


Oy 


1774 




A iliirAAArta 

melanoleuca 


p vtonVi mm ^ K 

L-Jf LUWJl U111C u 


706 


7R 
/o 


1775 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


98 


53 


1776 
i / /u 


AF1 1671 S 


JuLUiiiU sapiens 






O / 


mi 


AC008054 


Leishmania 
major 


L8453.1 


114 


28 


177R 




numu sapiens 


UoiaiJcupiaMlU liCUlUlJai alJLJgeil ivl/\ 1 


^07 


fj 


1770 
i / ty 




numu sapiens 


jnuuiaii j.iic>LiuL«yte ocuiclcu laCLur nor. 


1 


OU 


17&0 


on?dR0 


JTj.UJ.HL> sapiens 


jTLUHiaii seureieu proieuij oJcv^ iu inu. 
6^61 


1 Oo 


A/1 
04 


1781 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 

Dlillllai LU pUgU CICiJieilL 


80 


36 


1782 


AF174482 


Homo sapiens 


polycomb 3 


133 


46 


17£^ 




Jrtoino Sapiens 


puiaiive pi jU 


lyO 


/U 


1784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


59 


1 / OJ 




xiomo sapiens 


XT A n 1 ^ 


1 14 




1786 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


120 


44 


1787 


AF200187 


cercopithicine 
herpesvirus 15 


EBNA2-like protein 


108 


30 


I / oo 




nomo sapiens 


-— — 

Human secretea protein, SEQ IE) NOi 

6953. 


Q7 

y / 


JO 


17&Q 

i / oy 




JxallUS 

norvegicus 


salivary proime-nca protein 


1Z5 


/l A 


17QH 




jr iasm o en um 

viva\ 


pval 


AO 

yo 


51 


1791 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


114 


34 


17G7 


U477 / J> 


rioiiio Sapiens 


i^Jtvr l j ivi jCrJtvj / puxanve uansposase 
similar to pogo element 






1 TQQ 

l /yj 


uUUjzo 


ri 01110 sapiens 


xiuman secretea protein;, ott^ iu jncj. 
4400 


~\ AS. 
14D 


71 


1794 


AF104923 


Homo sapiens 


putative transcription factor 


142 


59 






numu sapiens 


xlUXIlall UXwabL wallOcl rcidLCCl prOLCin 

BCRB2. 




ou 


1706 




.rVI aUlUAjpsiis 
mciiicHia 




J / 


oz 


1797 


M22332 


Homo sapiens 


unknown protein 


118 


29 


1708 


V07603 

I U^£,U.7_J 


xi.uij.iu sapiens 


jiiuiiiaii seureicu pioucin encoueu uy 
crpnp 44 Hrmp T4T7~)AT^>99 


1 Art 

luu 


/ 1 


1799 

i / yy 


AT 3901 14 


T fMtihm ania 
j-jeisiiiiiaiua 

major 


Pvtrf^tTlf^lv fvctp'irK^Amlinf* rtr*h rtrntp i n 
cau ciiiwi_y si-ciiic/ vauuc iiL/ii piULcjii 


1 S4 


n7 


1800 


U93570 


Homo sapiens 


p40 


103 


56 


1801 


X99452 


L^y i^upci siv^un 

esculentum 


fSY'tp'Tl CtTl— lll/"*^ r-\v/-\+o 1 Tl T^1"F^4 
CALCllolll-lJJvC piULClil J->'li_> £ + 


IU 1 




1802 


L27428 


Homo sapiens 


reverse rranscr intake 


102 


34 


1803 


L27428 


Homo sapiens 


reverse transcriptase 


141 


43 


1804 


Ml 8933 


Mus museums 


alpha- 1 type-Ill collagen precursor 


118 


30 


1805 


X92485 


Plasmodium 
vivax 


pval 


106 


67 


1806 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


118 


65 



176 



WO 01/64835 



PCT/US01/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1 RH7 


\j\j c -t\jy l 




iAUuiaii secrexeu proxein, jc-y iu inc 
8172. 


1 Uj 


1 1 


1 ROR 




T—ff\ry\ /"> CF>rMPf)C 
n.ciiiiv/ acipiciib 


piaivupilillil ZD 


1 S4 


o 1 


1809 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


130 


53 


i qi n 

loll) 




JriaSmoaiuni 
VI V 3.X 


pval 




D4 


1811 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7RRR 


121 


66 


1812 


Y17833 


Human 

CilUU^CU.UUo 


env protein 


119 


81 


1813 


AF119851 


Homo sapiens 


PRO 1722 


130 


58 


1 R1 A 


AJJJO 1 


XVd.ll Lib 

11U1 VCgiV/UO 




1 ^R 


jU 


1815 


G03473 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7554 


111 


74 


1 R1£ 


A F0 1 ft 1 44 


jnlUIIlU oopiClla 


neurondj cnreaa proxein /c-in i Jr 




Do 


1817 


M19155 


Plasmodium 

IdlCipcli U U I 


S-antigen precursor 


164 


50 


1 O 1 o 


AC] i ROR? 

Jr\x. 1 1 OvOz. 


JULUillvJ JCipiCHo 


pinion? 




75 


1819 


W40353 


Homo sapiens 


Human unspecified protein from 


110 


52 


1 OZ.U 


T 10^5^ 


OUIliU isdpiCllb 


TMi+dtix/P* -r\ 1 ^O 
pULdLlVC piJV 


i i^f 


i < 


1 R91 


T T41 51R 

U^t 1 JJO 


^dCnOillaDUlLlS 

elegans 


proline ncn 


yj 






A VC\ 1 0 1 44 


riAJlIlO bdpicllb 


ncurond.1 xnreaa proxein fsxj / c-rs i jr 


1 <ZA 




1 R9'* 


W4R151 


T-Fr»mn cnnipnc 
n u i 1 1 v J ijdpiciia 


T— iiimcjn nrpciot f*cittf>pr volci+ciri nrAtoin 
XlUilldli UICd^L L,aliL»CI IcIaLCta proiclll 

BCRB2. 


1j 1 


5^ 


1824 


AF130089 


Homo sapiens 


PRO2550 


128 


40 


1825 


AF090944 


Homo sapiens 


PRO0663 


103 


45 


loZo 


A PAAI 1 1 5 


Arabidopsis 

Lllallallcl 


rZ4L/l. 1 o 


1U / 


40 


1 R97 


AF1 Q4517 


hatha OdAiAnc 

noino sapiens 


IN-rVvJ X J 




OS 

Zo 


1828 


AF009668 


multiple 
sclerosis 

retrovirus 


polyprotein 


185 


41 


1829 


AF01 fiftQQ 


IV JL LI 3 HXUloLv LX1 Ho> 


CllUlJllULL/lCdbC/ ICV Cl 3C U O.L li>L- J lpLdbC 


Ijj 




1 R30 






iyciiiciu.jj.ic i cL-cptui i 


51 £ 


R^ 


i R3 1 




Motrin QanipriQ 


t-Tnmjin cprrpfpH "nrntpin sPH TT^i T\TO* 

jn.Ulllclll aCtlCLCU pi ULCIII, OJjy kLJ INC. 

4535. 


10R 

luO 


40 


1832 


U88966 


Homo ^anien^ 

l J.\J111\J (jUL/ivlU 




434 


RQ 


1833 


M19155 


Plasmodium 
falciparum 


S-antiffen "orecur^or 


105 


32 


1834 


AF085809 


lVTu c ; itiurcuIxi^ 


^vnan^in TH 

ay Jitiu Jill J.*y 


OR 

yo 


11 
jj 


1835 


AK023003 


Homo sapiens 


unnamed rvrotfiin nrodnnt 


393 


81 


1836 


Y41740 


Homo sapiens 


Human PRO701 protein sequence. 


429 


78 


1837 


M36913 


Zea mays 


cell wall protein (put.); putative 


72 


35 


1 R1R 




ivJLus inusouius 


proline-rich protein 


QQ 


A(\ 
4U 






numdn 
herpesvirus 6 


TTR2 
Uoo 


1 /lO 




1840 


AF1 34304 


T-fomo Q^nipnQ 


Scar2 


R7 
o / 


17 


1841 


AC024772 


Caenorhabditis 
elegans 


contains similarity to Mus musculus ' 
alpha-NAC, muscle-specific form 
(GB:U48363) 


131 


25 


1842 


AB0023 66 


Homo sapiens 


KIAA0368 


153 


75 


1843 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


140 


47 



177 



WO 01/64835 



PCT/USO 1/04927 



SEQ 10 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1 Rdd 

l O t T t T 


AT 07SS76 


A ratal rJrvr> ci c 

thaliana 


cAiciiMir-iiKe proiciii 




71 
OD 


1 RdS 

1 OH^J 




I IvHIlvJ £>dpidlo 




01 
y i 


7/1 


1846 


MI 4228 


Gallus gallus 


c-beta-3 beta-tubulin 


598 


83 


1 %A7 


AT<Tfi77717 


rioino sapienb 


unndmea pr oxem prouuci 


07 


3o 






riuiiio sapieiib 


jtvauo KjUi/Kj i jr exc/nange lautor 

fi run nl r\ oi i 


1 7/1 
1 /4 




1849 


W23949 


Homo sapiens 


Human phosphoinositide 30H-kinase 
pi 01 subunit. 


143 


28 


1 R^n 


VJU O J OO 


riomu sapiens 


riuman secrexea proxein, oiiv^ iu invj. 

7R67 


1 A1 
IU I 


03 


1 R^l 

1 o-> 1 


AR01 7114 


xxuiliu bapicilo 


AD ^ 


1 1 j 


xUU 




nnos7n 


lYIUo I11U£>L,UIU£> 




^ 7/f 


3D 


1 R^7 


r\\J / J l v 


\At1C WllC/^IlllIC 


CONTAINING N ALLELE OF FV1 


ID J 


A'J 
4Z 


1 RS4 


TT4Q07A 


n.01110 acipidisj 


1 11 di 111 CI LI all bp \Jh clb c 


7 1 A 


5Z 


1 R^S 


TTQ7^6Q 

U7JJD" 


xauinu oapiciio 


rvlO 


yj 


^ 1 


1856 


D89729 


Homo sapiens 


CRM1 protein 


475 


90 


1 R^7 


AFOQfiRQs 


jntomo sapiens 


PT? OH 117 


on 

oy 




1858 


AF0 15926 


Homo sapiens 


ezrin-radixin-moesin binding 
ph osph opr o te in-5 0 


117 


73 


1859 


D13721 


Gallus gallus 


NF-kB p65 subunit 


223 


56 




Js.lozU4 


Homo sapiens 


salivary pro line-rich protein precursor 


1 1 1 


32 




W oooz / 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


195 


52 




T T/tQQ77 

U4yy / j 


Homo sapiens 


uivr i , JviiiiO / , putative transposase 
similar to pogo element 


O 1 1 


69 


1 R*^7 




Homo sapiens 


Human secreted protein, SEQ ID NO*. 
8172. 


o rv 

oU 


i j 


1&04 


T 77/17 R 


Homo sapiens 


reverse transcriptase 


O/l 


c 1 
51 


1865 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


106 


46 


1866 


U53585 


Mycobacterium 
avium 


fibronectin attachment protein 


86 


36 


1867 


AF255446 


Crypthecodinium 
cohnii 


Dip 1 -associated protein C 


134 


45 






Homo sapiens 


-— ; 

Human secreted protein, SEQ ID NO: 

OOlJ. 




63 


1 R6Q 




P \ ftt\ r* r\ r* r*n c 
r y i ULUbL Lia 


/jjad lung iiypuuicLiudi pruLcin. 


1 OR 


/lO 
4U 


1870 


M13100 


Rattus 


unknown protein 


121 


53 


1871 


X67863 


lVfii*; mn^ftiiliiQ 


T2 


101 

I V 1 


7^ 


1872 


S80119 


Rattus sp. 


reverse transcriptase homolog 


151 


43 


1 R77 


VV / J \}JJ 


XiAJlIiU aaplCllo 


HHmiin c?^r*f<3f"^»/i Mi"<rivaifi ] f\ti £± 
JTLUIIiall bccicicu piULClIl L<1UI1C. 


t*+u 


A A 


1874 


U57053 


Homo sapiens 


myosin-ID 


203 


82 


I & /D 




riomo SapicUo 


riuiiian oreast cancer related protein 
BCRB2. 




_)4 


1 R7^ 


A/T7 1 007 


Homo sapiens 


CD 19 differentiation antigen 


/I 70 


/y 


1 R77 
1 o / / 


dC\dCiQ 1 


riomo sapiens 


Human secreted protein, SEQ ID NO: 

R1 77 

OX I jL. 






1878 


U25281 


Rattus 
norvegicus 


QrJ'J Haiti sin h i n H i ti cr rifnt^in 

iJLXJ U\JllXdill UlllLiXllg pXVJLClii 


10R 


76 


1879 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


117 


48 


1880 


X73113 


Homo sapiens 


fast MyBP-C 


599 


77 


1881 


AX028128 


Homo sapiens 


unnamed protein product 


162 


43 


1882 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


138 


62 



178 



WO 01/64835 



PCT/US01/04927 



SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 








7870 
/ 0 / O. 






iOCO 


1 UUO04 


— — ; 

Homo sapiens 


open reading frame 1 (AA 1-86) 




34 


1 88A 


T 18^87 
UoOjo / 


IVlus musculus 


phosphatidylinositol 3 -kinase catalytic 


OA/1 
xU4 


11 








suuuniL p 1 1 u aena 






1 8RS 


AU / JJ / 


-TIOIIIO SapiCIlS 


ixurnan pre - in isj. > / v cieavage lacior i oo 


1 

1 HO 


in 








x\J~/CL oULIUIllL 






1886 


Y13829 


Aalalaiu oapiciio 


1V1X>1NJ_/ JpiUlClll 


1 1 7 


Oj 


1 RR7 


GOIQ^I 


1-Trvmr* c c» t^v t f» n q 
XaLHUO odJJICllo 


jmuiilail ocUClCU piUldll, OJ_/V^ VLJ l\\J . 


I 1 0 

I I y 


8A 








6012. 






1 888 


PQ^Ol ^ 


xiomo sapiens 


Neural thread protein. 


Ol 

y i 


04 


1 88Q 


t ^onso 
jL»^?yu->y 


rioino Sapiens 


trans cripnon racror i 




O /J 


1 8qa 


ivioyzy / 


Homo sapiens 




loy 


4o 


1 8Q 1 


pncn 1 1 

Kyoy 1 j> 


Homo sapiens 


Neural thread protein. 


91 


/TO 

62 


1 8QO 


T'OQOAI 


S acch aromy c e s 


UKr YJSJLZU2W 


o^ 

yj 


c 1 
51 






cerevisiae 










riAA/107 

OUU4Z / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


90 


51 








A < AQ 






1 80,4 


T Ld.007'} 

U4yy 1 j 


xiomo sapiens 


vjrvr 1 3 i\dJc/ixj / , putative transposase 


1 OD 


15 








similar to pogo element 








VI AAf C 


Homo sapiens 


phosphoinositide 3-kinase 


564 


84 


1 8QA 




ivius muscuius 


aipna-acimin J 


J / J 


o< 

OJ 


1 SO'? 
1 07 / 


riA'3907 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


oZ 


OO 








7878 






1 8Q8 
1070 


A/fo 1 ood 
iviZr 1 y U4 


-— = : 

Homo sapiens 


4F2 heavy chain antigen 




OA 


1 8QO 


AT*098007 




TCI A A 1 C\7 &. TM*rttp>in 
iviiA/\ lu/f protein 




^> Q 
JO 


1 onn 
iyuu 


uy /0 j3 


murid 


unknown 


9o 


41 






iicipcsvii US *+ 








1 on 1 
iyu 1 


A "DAI 1 1 /tO 

AJ3U1 1 14z 


Homo sapiens 


VT A A ACTA _ ' 

K1AA05 /U protein 


209 


95 


1 QA9 
iyuz 


A T? 1 Q/l ^ ^ 7 
rvr iy43 O 1 


Homo sapiens 


XT A n 1 ^ 


loi 


rr 


1 om 
iyuj 


Uy3Z>04 


Homo sapiens 


putative pi 50 


I4z 


33 


1 QA/1 
iyU4 


ivizyjyy 


Homo sapiens 


erythrocyte membrane protein band 4.2 


413 


AA 

90 




V99AAA 


Homo sapiens 


Human P450 reductase functional 


zy4 


0/ 








fragment sequence. 










iiomo sapiens 


rivjr Jrio riuman vjrowtn r actor 


10/ 


1 ah 








Homologue 3. 






1 007 


ATT 1 98^9^ 


— — : 

Homo sapiens 


CDC42 -binding protein kinase beta 


can 


©y 


1 008 


A T30901 d7 


Cucumis sativus 


expressed in cucumber hypocotyls 


yo 


/i 1 

41 


iyuy 


"VI ■3707 


Homo sapiens 


alpna-l type z collagen / 14 AA) 


87 


4j> 


1 01 a 
iy iu 


A A/^1 A 


Mus musculus 


1Q motit containing u 1 rase activating 


^jO 


48 








protein 1 






iy 1 1 


A T719QA9^ 


tt ; 

Homo sapiens 


1 -UUMJrl-rri A rKU 1 .biiN 1 , 1 Jcih 1 A 


Dl 1 


AA 

90 








<iT fRT r\nx ("tc*t> i TprrnTA^ 
oui5UiNH ^a - i-i riJti 1 ±\) 






1019 




ivius musc/Uius 


type yv v collagen 


87 


OQ 
Zo 




I 144oz 


Homo sapiens 


Fragment of human secreted protein 


124 


75 








encoaeu oy gene i /. 






1 014 


AF1719^0 


__ 

V i§na 


pnospnatiuic acia pnospnatase oeia 


1 ±2. 








nn en 101 il fitti 
u 1 1 g u x V/ lii a ia 








1 Q1 ^ 

1.27 L~J 


AF01 6000 


A/flic miicr*nlno 
IVlLlis 111 US L< 111 US 


eiiuoiiucieasey reverse tTaiiscripid.se 


1 AA 
1 UU 


4 / 


1 Q1 £ 

iy 10 


lYiy 4 1 3 I 


Homo sapiens 


mucin 


y / 


3 / 


ion 


Y1 ^88^ 


Nicotiana 


extensm (aa i-ozuj 




34 






tabacum 








iy io 


a T?t <?<c/ca<: 
Ar 1 oooUj 


Homo sapiens 


JVIL.L.Z protein 


1 15 


29 


1 Q10 

1 -7 A 27 


M191 10 


11 0 NIC 

lYJLUS ILlUbUlilUb 


i\j.N/\ polymerase n 


AO 8 

4y o 


OJ 


1920 


AL049794 


Homo sapiens 


dJ777L9.1 (novel protein similar to 


514 


90 








mouse kinesin-like proteins KIF1A and 












KIF1B) 






1921 


D83703 


Homo sapiens 


peroxisome assembly factor-2 


233 


64 


1922 


Y11922 


Homo sapiens 


Human 5* EST secreted protein SEQ ID 


162 


77 








No: 522. 







179 



WO 01/64835 



PCT/US01/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


i ot; 




Homo sapiens 


similar to nitrogen permease regulator; 
similar to Jrjyyzj ^riu.g/jui /uj, 
match to AA233630 (NID:gl 856833) 

and /\r\jyy*t\JA \iS WJ D l^f / ) 




79 


1924 


D82060 


Homo sapiens 


membrane protein with histidine rich 
charge clusters 


115 


40 


1 07 S 
1 y A, J 


TTAQQ7A 
XJ^yy 1 H 


nomo sapiens 


m driiier transposase 


1 QS 

1 yD 


0 1 


1926 


S80119 


Rattus sp. 


reverse transcriptase homolog 


104 


40 


1 QO'7 




Homo sapiens 


Human secreted protein, SEQ ID NOi 


ty 


Do 


1 07£ 


Q7Q<<^Q 

o /yojy 


iiomo sapiens 


ha i i^^uraiive rumour 

s upprci>t> or/ ii ere u i idry iiiiiii ip i c 

CAUalUaCa CallUlUciLC gCllC 




520 
OO 


1929 


X58063 


Brugia pahangi . 


major protein component of the 

llllwUllKUlcxi ailCdLil 


104 


43 


1930 


AF1 1 08SS 


T-Trvmr\ ccini p'ti c 

I 1 kJI IIKJ JsdJL/lCIlo 


PRO! R&7 
x ss\y i o*+ / 






103 t 




I XUIllvJ adplCllo 


o-pnoapiioiructo-Zi*-i\.iiiase 


^7^ 




1932 


AF020261 


Santalum album 


proline rich protein 


93 


33 


1Q33 
iyj J 




Homo sapiens 


hypothetical protein 


oZi 


OZ 


1 Q3A 


AT A*\1 HI 7 


in euro sp or a 

CLaSSa 


reiaiea lo ui oivi/\.LAj iNUv^jjrs/\ix 

RTRn*KTTTPT T^nPRnTFTTJ P 
INJLIjwIN kj ^LjSJjkJx J\U 1 J_/liN 


1 1 

1 lo 




1935 


A61971 


unidentified 


MCSP 


328 


79 






Homo sapiens 


putative pi 50 


z 1 / 


ZLA 


1937 


U97553 


murid 

herpesvirus 4 


unknown 


113 


38 


1938 


AF 194537 


Homo sapiens 


NAG 13 


90 


37 






Homo sapiens 




a m 
4l>3 


0 c 


1 qa n 


V£7./t 1 3 


Human 

herpesvirus 6 


UiSo 




en 


1 OA 1 


AJUU /OZa 


Rattus 
norvegicus 


hjljv cnannei i 


1 i 0 
1 12 


3o 


1 0A7 




/vraDiaopsis 

UlcxiJailci 


cen wdii-piasmd. memDrane unKer 




3D 


1943 


AF 1 3 0089 


Homo sapiens 


PRO2550 


93 


63 


1 OAA 




xiDiiio sapiens 


Jtiuman secreieu proieinj oxiv,/ inu. 
771 A 


1 A7 
1 VZ 


7A 


1945 


Ml 0410 


A/Tnc miicniliiQ 




171 


AO 


1046 


TT^73 1 6 


JTHJllILJ octpiCilo 


lllbLUllC dUCLyiLi dllblCl cloC 


1 


7^ 

/ J> 


1947 


at i633o? 


T-I nm r\ onni ad c 

XJ.VJI11U OdJJldlo 


lxuillctll Lypc S\. V 111 -Lr^llClgdl 


70 








r\S dUlUUp&Io 

thaliana 

W. X till CU 1 CI 




1 1 7 
11/ 




1949 


AJ011738 


Homo sarviens 


Inilb 


209 


85 


1950 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


155 


48 


1951 


AF1 1 9S69 


T-Trvmri cpt"»i 


nut r-fif-^H 7 


1 66 




1952 


AB002107 

•TT- U \J\J V\J 1 




hPpr 

11JT CI 


1 1 R 


30 

Dy 


1953 


X98834 


T-Trvmn ^ianif*n^ 


y\v\c fin opr rvrofrpin T-Tq?i17 

Zilll-U Allied piL'LCill JLlSdlZ, 


430 


70 


1954 


Y19641 


nuinu sauiciio 


SFO TO 1MO ^SQ from WOQQ9?74^ 




ftd 

VJ T 


1955 


U93569 


TTom o 5anipn<! 


nutatiVF 1 n 1 SO 

JJU.ldLlVC' pi JU 


1 57 


44 


1956 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


85 


53 


1957 


AB013729 


Mus musculus 


semaphorin Y 


106 


38 . 


i y j o 




nuino bctpieiis 


FA A 




Do 


1959 






1 XNJ.JT pi VJ LC1I1 


SOS 


0 s ? 
yj 


1960 


AC004022 


Homo sapiens 


serum paraoxonasearylesterase 3 


147 


62 


1961 


AF076776 


Drosophila 
melanogaster 


helicase DOMINO A 


157 


45 


1962 


AF265555 


Homo sapiens 


ubiquitin-conjugating BIR-domain 
enzyme APOLLON 


587 


77 


1963 


AF229642 


Mus musculus 


DXImx46e protein 


127 


91 



180 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


1964 


U72520 


Mus musculus 


mena protein 


89 


31 


1965 


W40j09 


Homo sapiens 


Human ITAK protein. 


179 


32 


1966 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


1967 


Y92515 


Homo sapiens 


Human OXRE-12. 


224 


53 


1968 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


187 


49 


1969 


Y41245 


Homo sapiens 


Human Y218 protein. 


220 


78 1 


1970 


AB052738 


Sus scrofa 


Smad3 


366 


85 


1971 


AB007644 


Arabidopsis 
thaliana 


contains similarity to phytocyanin/early 
nodulm-hke protein~genejd:Kl 9P 1 7.3 


1 06 


32 


1972 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A 1 A 

4414. 


122 


57 


1973 


AT 1 CT AT~t 

AL3 57472 


Homo sapiens 


VFS3jB 


1 12 


31 


1974 


Ar 151 902 


Homo sapiens 


CCiI-144 protein 


1 12 


95 


1975 


ATI'? T"> /fA 

ALT 3 7260 


Homo sapiens 


hypothetical protein 


148 


92 


1976 


U89505 


Homo sapiens 


HIark 


408 


89 


1977 


U67328 


Mus musculus 


NIPI-like protein 


168 


73 


1978 


AK026435 


Homo sapiens 


unnamed protein product 


601 


94 


1979 


Y14318 


Homo sapiens 


peroxisomal ABC-transporter 


507 


96 


1980 


U63630 


Homo sapiens 


MCM4 


570 


90 


1981 


AF1 18090 


Homo sapiens 


PRO2044 


154 


84 


1982 


AF0 16370 


Homo sapiens 


U4/U6 small nuclear ribonucleoprotein 
hPrp3 


422 


63 


1983 


AB011154 


Homo sapiens 


KIAA0582 protein 


420 


80 


1984 


ABO 11422 


Homo sapiens 


Trad 


201 


67 


1985 


AL390156 


Homo sapiens 


hypothetical protein 


230 


97 


1986 


AJ242540 


Volvox carter i f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


163 


44 


1987 


M12523 


Homo sapiens 


alloalbumin Venezia 


411 


91 


1988 


G02996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7077. 


145 


63 


1989 


Y36156 


Homo sapiens 


Human secreted protein #28. 


107 


80 


1990 


AF161356 


Homo sapiens 


HSPC093 


156 


57 


1991 


G03443 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7524. 


132 


72 


1992 


AF119851 


Homo sapiens 


PRO 1722 


126 


52 


1993 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


228 


60 


1994 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


40 


1995 


G02469 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6550. 


118 


58 


1996 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


137 


57 


1997 


X75068 


Bos taurus 


plasmalemmal porin 


85 


85 


1998 


Y64869 


Homo sapiens 


Human 5' EST related polypeptide SEQ 
ID NO: 1030. 


92 


80 


1999 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


275 


79 


2000 


AF1 18078 


Homo sapiens 


PRO 1 848 


1 1 1 


57 


2001 


S80119 


Rattus sp. 


reverse transcriptase homolog 


120 


61 




A t>A1 1 1 1 ft 


Homo sapiens 


jsj/\/\ioj> o protein 


J4o 


to 


2003 


M15530 


Homo sapiens 


B-cell growth factor 


94 


64 


2004 


AF225918 


Mus musculus 


intestinal cell kinase 


216 


77 


2005 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


85 


2006 


D84391 


Mus musculus 


reverse transcriptase 


135 


38 


2007 


AF090930 


Homo sapiens 


PRO0478 


101 


86 



181 
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SEQ ID 

NO: 


Accession No. 


Species 
. 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


zUUo 


UUjoU / 


Homo sapiens 


xiuman secreted protein, oiiv,/ iu invj. 

7£RR 


1 C/C 

1 DO 


66 


2009 


Y02671 


Homo sapiens 


Human secreted protein encoded by 

opnf» 77 HnnP R 

gene zz cione riivioj wio. 


165 


49 


2010 


AF1 13685 


Homo sapiens 


PRO0974 


124 


63 


901 1 


/-\.JT IjuW / 7 


noiiiu sapiens 




1 A\ 


70 
/ 0 


2012 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


51 




A T7AQAQ/M 


Homo sapiens 


rKvJUooo 


0 / 


53 


ZUi4 


Y UoUOZ 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA35638. 


151 


CO 

58 


o a i ^ 
ZU ID 


> AT700 

OUJ fy I 


Homo sapiens 


Human secreted protein, SbQ ID NO: 
7878. 


ooo 

223 


75 


ZU 10 


ao yioc 


Homo sapiens 


Human secreted protein, SBQ ID NO: 
6566. 


1 1 T 

137 


78 


zUl / 


JLzoyji 


Homo sapiens 


chromosomal protein 


120 


74 


OA 1 O 

zuio 


CjuJ64o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


68 


57 






Homo sapiens 


Human secreted protein, SEQ ID NO: 

'7707 

/ /z /. 


64 


42 


2020 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


334 


63 


2021 


L26953 


Homo sapiens 


chromosomal protein 


112 


64 


oaoo 


W4&J31 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


139 


69 


zUzj 


T TA-3 CCA 

U9^5o9 


Homo sapiens 


putative pi 50 


187 


89 


2024 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


80 


72 


2025 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
bEQ ID NO: 163. 


142 


71 


oao/c 
zUzo 


VO/C 1 AO 

I /oi9o 


1 

Homo sapiens 


Human secreted protein encoded by 
gene 75. 


77 


66 


zuz / 


A TO/IO ^/l A 

AJZ4ZD4U 


Vol vox carter i f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


1 TO 


73 


2028 


U49973 


Homo sapiens 


ORFi; MER37; putative transposase 
similar to pogo element 


159 


80 


2029 


AF 194537 


Homo sapiens 


NAG13 


158 


57 


2030 


L26251 


Trypanosoma 
brucei 


CR5 


110 


41 


2031 


AF197913 


Helicoverpa 
armigera nuclear 
polyhedrosis 
virus 


basic DNA-binding protein BDBP 


149 


55 


2032 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


115 


80 


2033 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/ Jyj. 


157 


82 


1CYXA 


/71AO/1 AO 


— ; 

Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 




75 


oa^ ^ 


UrUj)O40 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7727. 


1 A/t 

1U4 


cn 
Of 


ztoo 


urUUJzo 


Homo sapiens 


Human secreted protein, bEQ ID NO: 
4409. 


137 


71 




AF1 ^OftSQ 


tlUlllU bapiGIlci 




1 AA 


J3 


2038 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


81 


73 


2039 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


122 


66 


2040 


U49973 


Homo sapiens 


ORFI; MER37; putative transposase 
similar to pogo element 


159 


68 



182 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2041 


BO 1372 


Homo sapiens 


Neuron-associated protein. 


95 


79 




voi ^s 4 ; 


truman 
herpesvirus 4 


£> i ivr i, encoues JbiJiN/i-z (^amoaugn 
et al, 1 984; Dillner et al 5 1 984) 


1 Uo 


I I 


9043 






riuiiiaii secrcictj proiem, m>i-,K£ i.l,' inva 
6596. 


1 95 


09 








px? n 1 009 


1 0^ 
1 UJ> 




2045 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OO 1 J . 


146 


64 


2046 


W50192 


Homo sapiens 


Amino acid sequence of salivary 

r>rrvh=Mn fOXT 1 
prutem v^UlN - 1 . 


108 


51 


2047 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
aenveu xrom l/in/\jdojo. 


133 


68 


2048 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
jl>tn/\, compxexe virion genome 


155 


64 


90zlQ 
zu^y 


T L1QQ73 


jnonio sapiens 


wivr 1 3 ivijdjco / , puiauve uransposase 
similar to pogo element 




DD 


ZUju 


/vLjyi/i 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


1 IZ 


56 


oo^ 1 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


/U 


o2 


90^9 


V£1 OA*I 


i lyQla sp. 


mini- collagen 




4o 


2053 


X83413 


Human 
herpesvirus 6 


U88 


148 


73 


ZUj4 


009009 


riomo sapiens 


— — ~ ; -— 

Human secreted protein, SEQ ID NO: 

D70J. 


/-> 




905^ 


009539 


JnHJIlHJ ba.piCI15> 


rTunid.n seurexeQ protein, oriv^i yyj \>\\j. 
6613. 






9056 


X76594 

VY 1 UJZt 


JL/ 1 \j Ly U S LC 1 i LI 1 1 1 

discoideum 


paHiVkt re>crifxrt { A A J ^_ /f Q 7^ 
L-UUlllg, rc^lOIl ^^V/\ 1 -4-3 / ^ 


7 Ozl 


D 1 


9057 


RQ5913 




"NlRtirnl +V»T"f i ?»H nrntpin 
1NCUI al llllCa.Ll piULCLLl. 


OJ 


/ / 


2058 


AF1 18080 


Homo sapiens 


PRO1880 


126 


61 


9050 


A T996003 


rioino Sapiens 


vj/iivi proccin 


l JO 




9060 


OUZ453 


riomo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


ion 
izO 


/u 


ZUOi 


YH 1 0 1 5? 
AUl7l 5 


Drosophila 
melanogaster 


salivary gland glue protein 


1 HC\ 

I ly 


/i a 

4L) 


90A9 


TT00090 
UUUUZ7 


oaccnarornyces 

LOl C V iolclC 


i nrz i / cp 


1 1 A 




2063 


AF130051 


Homo sapiens 


PRO0898 


143 


80 


90£A 
ZU04 


afoo^oai 


riomo sapiens 


placental growth hormone isoform 

1JVJXT"* V _> 


1 O 1 


Of 


906S 


O0940Q 


rxuiiio sapiens 


i Luiiian scL/reicu protein, ojdv</ ijl/ inu. 
6490. 


1 90 

izy 


OO 


2066 






UliiVJll V/ Wll 


09 


05 


9067 


A T9 49 540 


V U1VUA \wCX\ LCJL I i.. 


iiyuiuAypiuiiiie-riuii giyuoprotein J-Jz^,- 
HRGP 


>>05 


/y 


2068 


L27428 


Homo sapiens 


reverse transcriptase 


186 


58 


2069 




T-Trvm n Qf»r\i pn q 




1 64 


/ D 


2070 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


271 


73 


2071 


AF 130051 


Homo sapiens 


PRO0898 


110 j 


56 


9079 


AF01 01 44 


raorno sapiens 


neuronal uireaa protem f c-jn i Jt 


I i / 


0/ 


2073 


AF1 57706 


T-Tnm an 

A X Will Gill 

herpesvirus 6B 


JJ'r 


1 OA 


AO 


2074 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


120 


41 


2075 


AP002460 


Arabidopsis 
thaliana 


gene_id:FlD9.26~unknown protein 


150 


27 


2076 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


130 


85 



183 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2077 


a 1 1 o /"V— r 

Ml 1897 


Mus musculus 


proline-rich salivary protein 


125 


42 


2078 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


66 


75 


2079 


Ar 041330 


Bodo saltans 


NADH dehydrogenase subumt 5 


185 


47 


2080 


AJ006470 


Homo sapiens 


cartilage-associated protein (CASP) 


139 


84 


2081 


AK024509 


Homo sapiens 


unnamed protein product 


132 


83 


2082 


U93564 


Homo sapiens 


putative pi 50 


137 


67 


2083 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


99 


71 


2084 


B01372 


Homo sapiens 


Neuron-associated protein. 


148 


81 


2085 


AL160371 


Leishmania 
major 


probable (hhv-6) u 1 1 02, variant a 
DNA, complete virion genome 


114 


76 


2086 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


255 


63 


2087 


AF 194537 


Homo sapiens 


NAG 13 


122 


78 


2088 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


74 


2089 


AF 13 0079 


Homo sapiens 


PR02852 


149 


77 


2090 


U93574 


Homo sapiens 


p40 


260 


96 


2091 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


99 


76 


2092 


AF 194537 


Homo sapiens 


NAG 13 


125 


51 


2093 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


319 


72 


2094 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


159 


56 


2095 


U93572 


Homo sapiens 


putative pi 50 


195 


57 


2096 


X83413 


Human 
herpesvirus 6 


U88 


132 


57 


2097 


AF0 10400 


Homo sapiens 


transaldolase-related protein 


463 


89 


2098 


U93563 


Homo sapiens 


putative pi 50 


128 


35 


2099 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


160 


43 


2100 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


133 


43 


2101 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


97 


71 


2102 


K02401 


Homo sapiens 


chorionic somatomammotropin 


628 


90 


2103 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


141 


80 


2104 


U49973 


Homo sapiens 


ORF1 ; MER37; putative transposase 
similar to pogo element 


309 


63 


2105 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


192 


37 


2106 


Y59772 


Homo sapiens 


Human normal ovarian tissue derived 
protein 49. 


261 


89 


2107 


AF202051 


Homo sapiens 


NM23-H8 


680 


100 


2108 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVI 
oncoprotein. 


127 


54 


2109 


L22029 


Glycine max 


hydroxyproline-rich glycoprotein 


121 


36 


2110 


D26135 


Homo sapiens 


diacylglycerol kinase gamma 


172 


100 


2111 


U49973 


Homo sapiens 


ORFl; MER37; putative transposase 
similar to pogo element 


159 


56 


2112 


X65488 


Homo sapiens 


hnRNP U protein 


117 


70 


2113 


ALj>59782 


ri-t 

Trypanosoma 
brucei 


probable similar to ring~h2 finger 
protein rhala. 


116 


35 


2114 


AF130051 


Homo sapiens 


PRO0898 


93 


62 


2115 


AJ242540 


Volvox carted f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


260 


53 


2116 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQIDNO:l63. 


148 


70 



184 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


1 Smith- 
Waterman 
Score 


% 

Identit 

y 


2117 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


304 


71 


2118 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


70 


66 


2119 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2120 


S79410 


Mus museums 


nuclear localization signal binding 
protein 


94 


94 


2121 


T T A AAao 

U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


304 


73 


2122 


MH901 


Rattus 
norvegicus 


proline-rich salivary protein 


83 


32 


2123 


AK025047 


Homo sapiens 


unnamed protein product 


116 


48 


2124 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


95 


72 


2125 


AF138883 


Bos taurus 


type II collogen cyanogen bromide 
fragment CB 10 


103 


40 


2126 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


186 


100 


2127 


G03H4 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


117 


52 


2128 


AF 194537 


Homo sapiens 


NAG 13 


126 


66 


2129 


ABO 15802 


Acetobacter 
xylinus 


similar to cellulose complementing 
protein of A. xylinum ATCC23869 


109 


80 


2130 


AF187147 


Mus museums 


drebrin A 


110 


38 


2131 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


148 


42 


2132 


L36341 


Aspergillus 
nidulans 


regulatory protein 


130 


49 


2133 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


119 


58 


2134 


AF1 16689 


Homo sapiens 


PR02168 


113 


81 


2135 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


191 


38 


2136 


M81321 


Macaca 
fascicularis 


proline-rich protein 


125 


43 


2137 


A18812 


Brassica napus 


extensin 


106 


34 


2138 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


104 


41 


2139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


92 


2140 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


196 


90 


2141 


Y57285 


Homo sapiens 


Human GPCR protein (HGPRP) 
sequence (clone ID 2214673). 


507 


78 


2142 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyprolme-nch glycoprotein DZ- 
HRGP 


201 


47 


2143 


AF090944 


Homo sapiens 


PRO0663 


164 


53 


2144 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


390 


76 


2145 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


70 


2146 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


119 


82 


Z14/ 


A TO A O C yl A 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


194 


45 


2148 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


91 


60 


2149 


AF090942 


Homo sapiens 


PRO0657 


129 


66 


2150 


AF130089 


Homo sapiens 


PRO2550 


372 


82 


2151 


AC009991 


Arabidopsis 


unknown protein 


81 


59 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






thaliana 








2152 


AF090942 


Homo sapiens 


PRO0657 


93 


53 


2153 


R13319 


Homo sapiens 


Partial Human Natural Killer receptor. 


215 


89 


2154 


AC008075 


Arabidopsis 
thaliana 


¥2415 A 


139 


36 


2155 


AT f\. t\ -\ t A 

AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


148 


50 


2156 


S79410 


Mus musculus 


nuclear localization signal binding 
protein 


1 12 


58 


2157 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


66 


2158 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


'85 


80 


2159 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


165 


75 


2160 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


145 


45 


2161 


AF1 19900 


Homo sapiens 


PR02822 


138 


71 


2162 


AJ223953 


Homo sapiens 


hPTTG 


106 


62 


2163 


AK023542 


Homo sapiens 


unnamed protein product 


76 


52 


2164 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


118 


84 


2165 


U41038 


Caenorhabditis 
elegans 


Similar to cadherin-type repeat 


137 


65 


2166 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


348 


88 


2167 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


108 


84 


2168 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


114 


48 


2169 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


125 


67 


2170 


Ar 130089 


Homo sapiens 


PKO2550 


142 


65 


2171 


Y19609 


Homo sapiens 


SEQ ID NO 327 from W09922243. 


103 


65 


2172 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


104 


39 


2173 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


91 


50 


2174 


D38435 


Homo sapiens 


homologue of yeast PMS1 


314 


96 


2175 


AF119851 


Homo sapiens 


PRO 1722 


230 


69 


2176 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


100 


63 


2177 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


137 


44 


2178 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 

/*T\YT1 TTTA1 

(PIR:JH0l62) 


139 


42 


2179 


A T A /t A/? AO 

AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


286 


57 


2180 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


148 


40 


2181 


A T/A1 A A CC 

AK024455 


Homo sapiens 


FLJ00047 protein 


97 


63 


2182 


R95913 


Homo sapiens 


Neural thread protein. 


100 


69 


2183 


Ar 266479 


Homo sapiens 


rectachrome l 


148 


81 


2184 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


64 


70 


2185 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


107 


40 


2186 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


110 


84 



186 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 




A 17 1 "2999^ 
s\ r Yj I J. J J 


Gallus gallus 


aipna 1 (v ) collagen 


103 


43 


2188 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


115 


36 


ti on 


jLjyuuo4 


Homo sapiens 


NCA-W272 


271 


100 


O t OA 


ATI OArtQQ 


Homo sapiens 


T>T> AOCCA 

jtKAJZCOU 


137 


68 


2191 


X65165 


Volvox carter! 


extensin 


113 


62 


2192 


U4997.3 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


154 


59 


2193 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


148 


59 


2194 


U52077 


Homo sapiens 


mariner transposase 


257 


53 


2195 


B12j>10 


Homo sapiens 


T T a. 1 j " til 

Human secreted protein encoded by 
gene 10 clone HDPGP94. 


98 


79 


2196 


D38l 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


180 


77 


2197 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


121 


65 


OI AO 

2198 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3 . 


126 


64 


2199 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
78o5. 


101 


67 


ZzUU 


AJUU/U42 


Homo sapiens 


1 KX5 protein 


264 


75 


2201 


AF130051 


Homo sapiens 


PRO0898 


71 


61 


2202 


AK024455 


Homo sapiens 


FL J 00047 protein 


153 


62 


2203 


U8j>30o 


Homo sapiens 


line-1 reverse transcriptase 


95 


52 


2204 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


128 


82 


2205 


AF090895 


Homo sapiens 


PRO0117 


163 


69 


2206 


G03806 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7887. 


154 


69 


2207 


R95913 


Homo sapiens 


Neural thread protein. 


103 


86 


2208 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


144 


78 


2209 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


100 


65 


2210 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


111 


45 


2211 


AF0909:>1 


Homo sapiens 


PRO0483 


63 


90 


2212 


W 8 8627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


100 


74 


2213 


Y 02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HI DAD22. 


98 


41 


2214 


M76671 


Lycopersicon 
esculentum 


extensin (class II) 


137 


35 


99 1 


AUj / 1 / 


Homo sapiens 


— ; , 

pot. unidentified reading frame 


no 

98 


54 




pn^Q 1 q 
rviO:? 1 j 


Homo sapiens 


Neural thread protein. 


109 


48 


ZZ1 / 


/\r 1 1 0U0O 


Homo sapiens 


JrJKAJ 1 y VZ 


138 


'ICS 

79 


ZZIo 


A T7AQ 1 A QA 


Homo sapiens 


alpha-tubulin isoform 1 


343 


95 


971 Q 

zziy 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


157 


O A 

84 , 


zzzu 


A TT1 A1 "3/19 


Homo sapiens 


alpha-tubulin 


571 


94 


2221 


AF071172 


Homo sapiens 


HERC2 


187 


84 




Ml / /oi 


Homo sapiens 


glia-derived nexin precursor 


529 


83 


2223 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


588 


85 


979/1 
ZZZ^t 


/\is.uzou /z 


Homo sapiens 


unnamed protein product 


199 


57 


2225 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


161 


61 


2226 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


126 


41 


2227 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


124 


63 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2228 


AF 119855 


Homo sapiens 


PRO 1847 


108 


74 


2229 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


122 


55 


2230 


AF090944 


Homo sapiens 


PRO0663 


146 


61 


2231 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


152 


74 


2232 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


80 


59 


2233 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


129 


58 


2234 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


143 


60 


2235 


A. TJ*"" f\f\ A A ^ £* 

AK024455 


Homo sapiens 


FLJ00047 protein 


133 


71 


2236 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 


59 


2237 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


111 


38 


2238 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


119 


63 


2239 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


75 


65 


2240 


L27428 


Homo sapiens 


reverse transcriptase 


136 


40 


2241 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


122 


82 


2242 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


105 


95 


2243 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


79 


48 


2244 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


145 


75 


2245 


U09116 


Homo sapiens 


ORF1, encodes a 40 kDa product 


346 


87 


2246 


U82303 


Homo sapiens 


unknown 


155 


79 


2247 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


105 


45 


2248 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


105 


47 


2249 


WO/TO A O 

Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


63 


2250 


A T?A 1 A 1 /I A 

Ar010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


124 


62 


2251 


X81206 


Drosophila hydei 


histone H3.3 


101 


71 


2252 


AF155581 


Danio rerio 


proteasome subunit beta 7 


92 


52 


2253 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


172 


60 


2254 


AF084225 


Homo sapiens 


cytochrome P450 2E1 


114 


46 


2255 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


136 


64 


2256 


AT 1 "> O /I 1 

AL132841 


Caenorhabditis 
elegans 


Y15E3A.3 


147 


90 


2257 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


297 


59 




U9j563 


Homo sapiens 


putative pi 50 


218 


66 


2259 


-L22650 


Homo sapiens 


early lymphoid activation protein 


82 


55 


2260 


AF194537 


Homo sapiens 


NAG 13 


117 


56 


2261 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


130 


82 


2262 


G00366 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4447. 


86 


66 


2263 


AL021918 


Homo sapiens 


b34I8. 1 (Kruppel related Zinc Finger 
protein 184) 


372 


61 


2264 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


122 


71 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2265 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


123 


79 


2266 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
bEQ ID NO: 163. 


204 


95 


226/ 


T OA AO A 


Chlamydomonas 
reinhardtii 


amino acid feature: Rod protein 
domain., aa 266 .. 468; amino acid 
feature: globular protein domain, aa 32 

9A<\ 


141 


58 


2268 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoaeu. oy gene z bey id jn{J.zdu. 


499 


84 


2269 


A11693 


Homo sapiens 


start codon not included 


594 


87 


OO^A 


A Ttf\ 1 /I ^<A 


Homo sapiens 


rviAAuco4 protein 


141 


63 


2271 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Ol /JO 

o 14o. 


112 


60 


2272 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


118 


54 


22 / J 


Y20852 


Homo sapiens 


Human neurofilament-H mutant protein 
fragment 1 L 


125 


35 


2274 


G03453 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

/Jo4. 


92 


66 


2275 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ull02, variant a dna, 
complete virion genome. 


198 


64 


22 /D 


U 15o4 / 


Mus musculus 


reverse transcriptase 


100 


72 


22 / / 


AJUU4o 10 


Zea mays 


cytochrome P450 monooxygenase 


129 


43 


2278 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A A A 

44Uy. 


in 


67 


zz /y 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
4670. 


1 12 


68 


on qa 
z/oU 


L,Z / 4zo 


Homo sapiens 


reverse transcriptase 


1 CA 

150 


60 


2281 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


142 


40 




U4yy /3 


Homo sapiens 


URr 1 ; JVLhR37; putative transposase 
similar to pogo element 


321 


54 


2283 


D00570 


Mus musculus 


open reading frame (251 AA) 


169 


56 


on o /i 

2284 


\/ AO O OO 

Y 02887 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


86 


54 


2285 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


144 


50 


2286 


AF1 19901 


Homo sapiens 


PR02831 


116 


82 


22o / 


ouj /y / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/o / o. 


126 


73 


zzoo 


Ar i JUUoy 


— j 

Homo sapiens 




1 AO 

102 


75 


2289 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AA ft/l 
4454. 


113 


70 




UUZJJO 


oomo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


1 T7 
12 / 


64 






Homo sapiens 


chromosomal protein 


137 


53 


zzyz 


ATT1 £1 'i s£ 

Ar loU jo 


Homo sapiens 




1 AA 
100 


57 


2293 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

AAfiQ 
HH\Jy. 


138 


68 


2294 


X03145 


Homo sapiens 


pot. ORF I 


120 


43 


zzyo 




Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


83 


52 


99Q6 

ZZ7U 


ATT (\OAA^< 


noino sapiens 


"FT TA 00/1/7 nr^tAin 

ri^juuu'H-/ protein 


QQ 

yo 


aa 


2297 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


123 


73 


2298 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


349 


73 


2299 


GO 1984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


109 


61 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


100 


81 


2301 


AF251290 


Plasmodium 
falciparum 


glutamic acid-rich protein 


112 


40 


2302 


A TT 1 f foil 

Ar 155232 


Pisum sativum 


extensin 


89 


36 


2303 


Ab 130089' 


Homo sapiens 


PRO2550 


83 


50 


2304 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


84 


51 


2305 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


86 


30 


2306 


AP002031 


Arabidopsis 
thaliana 


gene_jd:K3D20.3~ 


168 


44 


2307 


AF157321 


Homo sapiens 


30kDa protein 


309 


64 


2308 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


170 


57 


2309 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


198 


42 


23 20 


Y27607 


Homo sapiens 


Human secreted protein encoded by 
gene No. 41. 


207 


100 


2311 


AF130089 


Homo sapiens 


PRO2550 


106 


66 


2312 


G00437 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4518. 


117 


60 


2313 


R10755 


Homo sapiens 


Non-A non-B hepatitis specific 
antigenic protein encoded by 
phageclone lambda HC2533. 


114 


80 


2314 


AB032910 


Hylobates 
muelleri 


dopamine receptor D4 


108 


40 


2315 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


59 


2316 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


58 


2317 


AF238235 


Entamoeba 
histolytica 


diaphanous protein 


103 


51 


2318 


U15647 


Mus musculus 


reverse transcriptase 


101 


37 


2319 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


117 


51 


2320 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


163 


78 


2321 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


97 


48 


2322 


D00570 


Mus musculus 


open reading frame (196 AA) 


122 


39 


2323 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


122 


75 


2324 


G02639 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6720. 


118 


79 


2325 


U44838 


Glycine max 


extensin 


126 


36 


2326 


Y87103 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 142. 


99 


59 


2327 


Y86573 


Homo sapiens 


Human gene 91 -encoded protein 
fragment, SEQ ID NO:490. 


408 


79 


2328 


AF 194537 


Homo sapiens 


NAG 13 


181 


66 


2329 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


405 


75 


2330 


X53581 


Rattus 
norvegicus 


ORF4 


116 


45 


TOO 1 


L20953 


Homo sapiens 


chromosomal protein 


108 


67 


2332 


L27428 


Homo sapiens 


reverse transcriptase 


117 


76 


,2333 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


84 


50 


2334 


AF191687 


Homo sapiens 


alanine-glyoxylate aminotransferase 
homolog 


100 


37 


2335 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


84 


68 
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SEQ ID 
NO: 


Accession No* 


Species 




Smith- 

Waterman 

Score 


OA 
/o 

Identit 

y 


2336 


AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


106 


53 


2337 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


168 


65 


2338 


R95913 


Homo sapiens 


Neural thread protein. 


164 


67 


2339 


AF090942 


Homo sapiens 


PRO0657 


122 


81 


2340 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


107 


90 


2341 


AF090942 


Homo sapiens 


PRO0657 


124 


60 


2342 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


126 


83 


2343 


ABO 13454 


Rattus 
norvegicus 


NaPi-2 beta 


143 


77 


2344 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


127 


73 


2345 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


146 


45 


2346 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


120 


40 


2347 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


55 


2348 


Y36203 


Homo sapiens 


Human secreted protein #75. 


138 


78 


2349 


L26953 


Homo sapiens 


chromosomal protein 


131 


57 


2350 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


120 


65 


2351 


S58722 


Homo sapiens 


X-linked retinopathy protein {C- 
terminal, clone XEH.Sc} 


144 


58 


2352 


AF130051 


Homo sapiens 


PRO0898 


155 


70 


2353 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


134 


76 


2354 


L27428 


Homo sapiens 


reverse transcriptase 


141 


71 


2355 


L26953 


Homo sapiens 


chromosomal protein 


128 


66 


2356 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


75 


2357 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


115 


62 


2358 


B12310 


Homo sapiens 


Human secreted protein encoded by 
gene 10 clone HDPGP94. 


115 


50 


2359 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


103 


43 


2360 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


82 


80 


2361 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


95 


72 


2362 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


117 


41 


2363 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


80 


2364 


AL451017 


Neurospora 
crassa 


related to UI SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


152 


45 


2365 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


85 


55 


2366 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


101 


77 


2367 


AF093097 


Homo sapiens 


putative RNA-binding protein Q99 


248 


97 


2368 


U52077 


Homo sapiens 


mariner transposase 


227 


74 


2369 


X07882 


Homo sapiens 


Po protein 


102 


38 


2370 


U44838 


Glycine max 


extensin 


102 


32 


2371 


ABO 12223 


Canis familiaris 


ORF2 


158 


60 


2372 


AF025467 


Caenorhabditis 


contains similarity to drosophila DNA- 


104 


42 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






elegans 


binding protein K10 (NID:g8148) 






2373 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


107 


40 


2374 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


106 


67 


2375 


AF1 18086 


Homo sapiens 


PRO 1992 


169 


65 


2376 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


95 


55 


2377 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


121 


59 


2378 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


141 


70 


2379 


R95913 


Homo sapiens 


Neural thread protein. 


120 


50 


2380 


U93572 


Homo sapiens 


putative pi 50 


124 


53 


2381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


195 


66 


2382 


U52077 


Homo sapiens 


mariner transposase 


282 


67 


2383 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


113 


43 


2384 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


112 


37 


2385 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


88 


75 


2386 


AF130089 


Homo sapiens 


PRO2550 


160 


62 


2387 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


166 


61 


2388 


AF130089 


Homo sapiens 


PRO2550 


103 


71 


2389 


AB027890 


Schizosaccharom 
yces pombe 


Hypothetical protein 


116 


100 


2390 


U93570 


Homo sapiens 


putative pi 50 


151 


54 


2391 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


110 


71 


2392 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


109 


67 


2393 


AE001381 


Plasmodium 
falciparum 


hypothetical protein 


94 


34 


2394 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQIDNO:163. 


156 


62 


2395 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7133. 


57 


56 


2396 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


107 


43 


2397 


AF2 10651 


Homo sapiens 


NAG 18 


162 


54 


2398 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


116 


34 


2399 


S80864 


Homo sapiens 


cytochrome c-like polypeptide 


115 


68 


2400 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


141 


54 


2401 


M13100 


Rattus 
norvegicus 


unknown protein 


148 


46 


2402 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


133 


70 


2403 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


2404 


AF1 13685 


Homo sapiens 


PRO0974 


112 


56 


2405 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


98 


67 


2406 


R41001 


Homo sapiens 


Human myotonic dystrophy gene 
protein. 


207 


68 


2407 


AL390114 


Leishmania 


probable (hhv-6) ull02, variant a 


127 


63 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






major 


DNA, complete virion genome 






2408 


G03101 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7182. 


378 


95 


2409 


Y91452 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO: 125. 


250 


86 


2410 


U72520 


Mus musculus 


mena protein 


115 


40 


2411 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


132 


44 


2412 


AF095770 


Homo sapiens 


PTH-responsive osteosarcoma Dl 
protein 


109 


85 


2413 


AF1 19901 


Homo sapiens 


PR02831 


113 


66 


2414 


R95913 


Homo sapiens 


Neural thread protein. 


93 


57 


2415 


B01372 


Homo sapiens 


Neuron-associated protein. 


104 


83 


2416 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


130 


50 


2417 


J02459 


bacteriophage 
lambda 


K (tail component; 199) 


720 


92 


2418 


J04694 


Mus musculus 


alpha- 1 type IV collagen 


103 


43 


2419 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


145 


61 


2420 


W26496 


Homo sapiens 


CD2 associated intracellular protein 
CAIPLS02-21. 


115 


80 


2421 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


145 


38 


2422 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


158 


73 


2423 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


156 


70 


2424 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


122 


61 


2425 


Y30822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


95 


59 


2426 


U93565 


Homo sapiens 


putative pi 50 


147 


75 


2427 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


246 


53 


2428 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


138 


61 


2429 


Y19767 


Homo sapiens 


SEQ ID NO 485 from W09922243. 


112 


75 


2430 


D13892 


Homo sapiens 


carboxyl methyltransferase 


181 


68 


2431 


X97675 


Homo sapiens 


plakophilin 2b 


131 


65 


2432 


G02896 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6977. 


108 


70 


2433 


X03145 


Homo sapiens 


pot. ORF III 


180 


82 


2434 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


77 


86 


2435 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


421 


67 


2436 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


38 


2437 


AF2 10651 


Homo sapiens 


NAG 18 


128 


72 


2438 


AFO 16099 


Mus musculus 


endonuclease/reverse transcriptase 


232 


46 


2439 


AL 160493 


Leishmania 
major 


probable (hhv-6) ull02, variant a 
DNA, complete virion genome 


122 


53 


2440 


G00376 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4457. 


136 


80 


2441 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


139 


54 


2442 


AF090895 


Homo sapiens 


PRO0117 


115 


61 


2443 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


125 


70 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2444 


G00594 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4675. 


71 


80 


2445 


G03 790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


81 


37 


2446 


AF090895 


Homo sapiens 


PRO0117 


86 


64 


2447 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


103 


86 


2448 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


94 


61 


2449 


L21936 


Homo sapiens 


succinate dehydrogenase flavoprotein 
subunit 


168 


91 


2450 


AF090942 


Homo sapiens 


PRO0657 


134 


71 


2451 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


126 


87 


2452 


K03179 


Homo sapiens 


pro-alpha- 1 type-I collagen 


120 


44 


2453 


AF1 18082 


Homo sapiens 


PRO1902 


143 


46 


2454 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


114 


68 


2455 


AF130089 


Homo sapiens 


PRO2550 


125 


89 


2456 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


134 


71 


2457 


Y36243 


Homo sapiens 


Human secreted protein encoded by 
gene 20. 


93 


75 


2458 


Y09945 


Rattus 
norvegicus 


putative integral membrane transport 
protein 


166 


46 


2459 


AF 130052 


Homo sapiens 


PRO0956 


75 


48 


2460 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


162 


76 


2461 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


123 


69 


2462 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


157 


48 


2463 


X83413 


Human 
herpesvirus 6 


U88 


236 


50 


2464 


G00517 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4598. 


181 


53 


2465 


X83413 


Human 
herpesvirus 6 


U88 


218 


50 


2466 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


147 


57 


2467 


X83413 


Human 
herpesvirus 6 


U88 


196 


53 


2468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


165 


61 


2469 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


101 


46 


2470 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


221 


56 


2471 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


136 


63 


2472 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ull02, variant a dna, 
complete virion genome. 


122 


94 


2473 


AF161361 


Homo sapiens 


HSPC098 


120 


60 


2474 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


142 


40 


2475 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


130 


58 


2476 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


78 


48 


2477 


AL160371 


Leishmania 


probable (hhv-6) ul 102, variant a 


86 


46 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






major 


DNA, complete virion genome 






2478 


GO 1478 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5559. 


96 


66 


2479 


AL033545 


Arabidopsis 
thaliana 


extensin-like protein 


114 


40 


2480 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ull02, variant a dna 9 
complete virion genome. 


142 


59 


2481 


AL390114 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA ? complete virion genome 


146 


62 


2482 


AL 132902 


Caenorhabditis 
elegans 


Y71A12B.4 


160 


37 


2483 


AF154502 


Homo sapiens 


quiescent cell proline dipeptidase 


439 


88 


2484 


AF0 10326 


Drosophila 
melanogaster 


short form of CHIP 


56 


42 


2485 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dli40 3. 


119 


50 


2486 


W31186 


Homo sapiens 


Human pi 60 polypeptide 160.2. 


115 


60 


2487 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


74 


2488 


L26953 


Homo sapiens 


chromosomal protein 


129 


68 


2489 


Y21418 


Homo sapiens 


Human high mobility group protein 
HMGI-C mutant fragment 2. 


96 


46 


2490 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


366 


94 


2491 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


158 


76 


2492 


U12707 


Homo sapiens 


Wiskott-Aldrich syndrome protein 


144 


47 


2493 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 1 7. 


117 


81 


2494 


X92485 


Plasmodium 
vivax 


pval 


116 


64 


2495 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


92 


64 


2497 


AF161483 


Homo sapiens 


HSPC134 


211 


78 


2498 


X05006 


Homo sapiens 


S-protein 


298 


92 


2499 


AB032911 


Hylobates agilis 


dopamine receptor D4 


89 


42 


2500 


M98502 


Mus musculus 


pMLZ-4 


325 


90 


2501 


Y94920 


Homo sapiens 


Human secreted protein clone 
pm412_12 protein sequence SEQ ID 
NO:46. 


569 


85 


2502 


AB049054 


Homo sapiens 


brain link protein- 1 


224 


90 


2503 


AF064604 


Homo sapiens 


KE03 protein 


224 


37 


2504 


Z69727 


Schizosaccharom 
yces pombe 


putative dna-directed rna polymerase iii 
130 kd polypeptide (ec 2.7.7.6) 


384 


61 


2505 


Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


147 


43 


2506 


A T/~\ 1 r\zif\ A 

A JO 10604 


Mus musculus 


L-Sox5 protein 


366 


87 


2507 


X83413 


Human 
herpesvirus 6 


U88 


203 


46 


2508 


AJ277425 


Globodera 
pallida 


putative cuticular collagen 


125 


40 


2509 


ABO 17919 


Homo sapiens 


peptidylarginine deiminase type V 


148 


81 




Ar 00 1947 


Homo sapiens 


U4/U6~associated RNA splicing factor 


583 


97 


2511 


AJ238520 


Homo sapiens 


putative transcription factor-like 
nuclear regulator 


722 


100 


2512 


AL390736 


Homo sapiens 


bA209JI 9. 1 . 1 (GW1 12 protein) 


557 


84 


2513 


D84223 


Homo sapiens 


leucyl tRNA synthetase 


1113 


100 


2514 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


174 


56 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2515 


U60803 


Homo sapiens 


clathrin heavy chain 2 


1 1 1 


92 


2516 


AJ388557 


Canis familiaris 


zinc finger protein 


826 


56 


2517 


AB02725 1 


Homo sapiens 


zinc finger protein (ZFD25) 


631 


85 


2518 


AK023160 


Homo sapiens 


unnamed protein product 


168 


54 


2519 


D87326 


Mus musculus 


GSG2 


575 


73 


2520 


G03371 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7452. 


100 


42 


2521 - 


X67688 


Homo sapiens 


transketolase 


120 


66 


2522 


AF074086 


Homo sapiens 


protease 


390 


86 


2523 


AF220509 


Homo sapiens 


transcription associated factor 
TAFII31L 


801 


99 


2524 


AL078463 


Homo sapiens 


dJ365I19.1 (KIAA0456) 


374 


92 


2525 


AF038995 


Mus musculus 


putative RNA helicase RCK 


160 


93 


2526 


M60618 


Homo sapiens 


nuclear autoantigen 


116 


75 


2527 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


101 


57 


2528 


X68790 


Homo sapiens 


Bactericidal /Permeability Increasing 
Protein 


136 


90 


2529 


AO 1592 


Homo sapiens 


haemoglobin A beta chain 


508 


94 


2530 


U89277 


Homo sapiens 


polyhomeotic 1 homolog 


404 


79 


2531 


X90845 


Rattus 
norvegicus 


alphall spectrin 


507 


87 


2532 


AL137081 


Arabidopsis 
thaliana 


phenyl alanin e-tRN A synthetase-like 
protein 


178 


33 


2533 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


363 


100 


2534 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


113 


42 


2535 


X61047 


Hydra sp. 


mini-collagen 


102 


38 


2536 


AB006330 


Mus musculus 


SOX5 


559 


94 


2537 


AC006283 


Arabidopsis 
thaliana 


Bn/Spm-like transposon protein 


149 


33 


2538 


AF 196779 


Homo sapiens 


JM11 protein 


139 


57 


2539 


AK000741 


Homo sapiens 


unnamed protein product 


233 


47 


2540 


Y 18046 


Homo sapiens 


FGFR1 oncogene partner (FOP) 


315 


84 


2541 


U54996 


Homo sapiens 


HZW10 


371 


81 


2542 


G0263 1 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6712. 


99 


45 


2543 


M95610 


Homo sapiens 


alpha-2 IX collagen 


93 


34 


2544 


AF071173 


Mus musculus 


Herc2 


349 


82 


2545 


AB000516 


Homo sapiens 


DSIFpl60 


600 


93 


2546 


AL35517S 


Homo sapiens 


dJ947L8.1.6 (novel CUB and Sushi 
(SCR repeat) domain protein) 


395 


90 


2547 


U93574 


Homo sapiens 


putative pi 50 


235 


76 


2548 


X80035 


Oryctolagus 
cuniculus 


cysteine rich hair keratin associated 
protein 


99 


31 


2549 


U58088 


Homo sapiens 


Hs-CUL-2 


592 


86 


2550 


L26953 


Homo sapiens 


chromosomal protein 


131 


63 


2551 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


99 


61 


2552 


U39412 


Homo sapiens 


alpha SNAP 


159 


54 


2553 


AF248651 


Homo sapiens 


RNA-binding protein BRUNOL4 


569 


93 


2554 


AJ006519 


Rattus 


ASIC-beta 


164 


73 






norvegicus 








2555 


U93570 


Homo sapiens 


putative pi 50 


169 


52 


2556 


AF1 04328 


Arabidopsis 
thaliana 


cell wall-plasma membrane linker 
protein homolog 


105 


29 


2557 


AB028975 


Homo sapiens 


KIAA1052 protein 


610 


83 


2558 


AJ245621 


Homo sapiens 


CTL2 protein 


286 


47 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2559 


AF0 90944 


Homo sapiens 


PRO0663 


174 


78 


2560 


Y17832 


Human 
endogenous 
retrovirus K 


pol protein 


411 


63 


2561 


AF1 09907 ' 


Homo sapiens 


S164 


279 


44 


2562 


A09561 


synthetic 
construct 


human serum albumin 


680 


91 


2563 


U48359 


Gallus gallus 


kinesin light chain 


575 


81 


2564 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


435 


88 


2565 


D38H2 


Homo sapiens 


-V T A TMT 111 1 *J_ it 

NADH dehydrogenase subumt 4 


623 


89 


2566 


AF154916 


Chlamydomonas 
reinhardtii 


variable flagellar number protein 


117 


36 


2567 


AF181657 


Drosophila 
melanogaster 


BcDNA.LD34475 


261 


42 


2568 


M12530 


Homo sapiens 


transferrin precursor 


693 


87 


2569 


U96915 


Homo sapiens 


sin3 associated polypeptide pi 8 


651 


93 


2570 


R74205 


Homo sapiens 


Human death associated protein DAP- 
2. 


285 


27 


2571 


AL078593 


Homo sapiens 


dJ210BLl (KIAA0680) 


107 


37 


2572 


AF293405 


Phaseolus 
coccineus 


seed-micropylar-endothelium-specific 
protein 


88 


39 


2573 


G03114 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7195. 


110 


49 


2574 


AB024601 


Pseudornonas 
aeruginosa 


uridylyl transferase 


120 


37 


2575 


AB050893 


Anadara nodifera 


cytochrome c oxidase subunit 1 


111 


79 


2576 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


130 


72 


2577 


Ml 7697 


Homo sapiens 


glutamate dehydrogenase 


369 


61 


2578 


AF045640 


Caenorhabditis 
elegans 


CI 1D2.4 gene product 


264 


43 


2579 


U71382 


Homo sapiens 


OB binding protein- 1 


273 


86 


2580 


AF 116661 


Homo sapiens 


PR01438 


114 


61 


2581 


AK000496 


Homo sapiens 


unnamed protein product 


153 


55 


2582 


A67510 


Mus musculus 


MUS MUSCULUS GENOMIC DNA 
CONTAINING N ALLELE OF FV1 
GENE. 


190 


38 


2583 


"V T A ^ or 1 

Y44851 


Homo sapiens 


X T. ^ , _ /~tx\^ f~\ T X" S~ j • 

Human CD39-L66 protein. 


205 


97 


2584 


AB029151 


Homo sapiens 


D29 


253 


73 


2585 


AF039023 


Homo sapiens 


Ran-GTP binding protein; RanBP6 


765 


94 


2586 


AF054180 


Homo sapiens 


hematopoietic cell derived zinc finger 
protein 


116 


34 


2587 


AF283645 


Homo sapiens 


folate transporter/carrier 


580 


82 


2588 


AF090930 


Homo sapiens 


PRO0478 


132 


67 


2589 


AC002339 


Arabidopsis 
thaliana 


putative ABC transporter 


238 


34 


2590 


M34668 


Homo sapiens 


protein tyrosine phosphatase (EC 
3.1.3.48) 


627 


86 


2591 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


141 


63 


2592 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


142 1 


63 






Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


146 


54 


2594 


AF047437 


Homo sapiens 


sperm acrosomal protein 


526 


74 


2595 


U28789 


Mus musculus 


PACT 


528 


71 


2596 


AB0023 66 


Homo sapiens 


KIAA0368 


615 


83 


2597 


AL049610 


Homo sapiens 


dJ1055C14.2 (KIAA0026 
(transcription factor-like protein 


704 


93 
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SEQ ID 
NO: 


Accession No- 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








MRGX)) 






2598 


G01279 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

5^60. 


385 


79 






Homo sapiens 


T>T>/O.A 1 17 


1 /to 
14z 


61 


2600 


D29763 


Mus musculus 


seizure-related gene product 6 
precursor 


119 


76 


zoU 1 


T TO A 1 <. Q 


Homo sapiens 


oLrwo 


1 r*o 
108 


53 


2o02 




Homo sapiens 


NADH dehydrogenase subunit 2 


O A O 

242 


78 




JJ5o 1 Iz 


Homo sapiens 


NADH dehydrogenase subunit 2 




73 


2604 


T O/l Q A/1 


Homo sapiens 


pzi 


OCA 

259 


z:o 

62 


2605 


vioi r»o 
Y 12102 


Homo sapiens 


Human 5 1 EST secreted protein SEQ ID 
NO: 415. 


132 


92 


2606 


U79284 


Homo sapiens 


SEC14L 


614 


81 


2607 


A T?0 AO O £ 

At 202635 


Homo sapiens 


Tin i o r\f\ 

PP1200 


105 


47 


2608 


AF277374 


Homo sapiens 


enhancer of polycomb 


254 


85 


2609 


°V/"iO A O A 

X03484 


Homo sapiens 


rat protein (aa 1-648) 


600 


82 


2610 


U15637 


Homo sapiens 


CD40 binding protien 


465 


82 


2611 


t t /t r\o /T c 

U40265 


Trypanosoma 
cruzi 


ATPase subunit 6 


98 


31 


2612 


AJr 090942 


Homo sapiens 


T>T> f~\ A^CI 

PRO0657 


125 


48 


O/C 1 o 

2613 


V 00662 


Homo sapiens 


cytochrome oxidase I 


605 


82 


O/C 1 A 

2014 


T T/1 /CAO A 

U46yz0 


Homo sapiens 


metaxin 


748 


94 


O/C 1 c 

2615 


Y19757 


Homo sapiens 


SEQ ID NO 475 from W09922243. 


113 


87 


2616 


AK025047 


Homo sapiens 


unnamed protein product 


173 


58 


2617 


D13866 


Homo sapiens 


alpha-catenin 


569 


96 


2618 


T TO AO/ 

U20536 


Homo sapiens 


cysteine protease Mch2 isoform alpha 


588 


87 


2619 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


608 


86 


2620 


AF190168 


Homo sapiens 


serum albumin precursor 


522 


78 


2621 


A06977 


Homo sapiens 


albumin 


607 


93 


2622 


R14584 


Homo sapiens 


TGF beta 1 binding protein encoded by 
clone BPA 13. 


339 


81 


2623 


X56698 


Xenopus laevis 


42Sp48 


117 


47 


2624 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


131 


57 


2625 


X07881 


Homo sapiens 


proline-rich protein Gl 


I 16 


32 


2626 


IT T A r>o c •% 

W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


89 


58 


2627 


A00279 


synthetic 
construct 


Human serum albumin 


564 


95 


2o2o 


A T/" /AO 1 /Tin 

AK021613 


Homo sapiens 


unnamed protein product 


O 1 A 

214 


70 


2629 


M23613 


Homo sapiens 


nucleophosmin 


486 


82 


2630 


AF243424 


Homo sapiens 


SG2NA beta isoform 


256 


98 


2631 


000506 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4587. 


81 


60 


2632 


A06977 


Homo sapiens 


albumin 


457 


74 


2633 


A06977 


Homo sapiens 


albumin 


563 


93 


O /CO A 

2634 


X jf O 1 AOO 

M81088 


Rattus 
norvegicus 


EF-1 -alpha 


165 


68 


26 d 5 


AL3 59587 


Homo sapiens 


hypothetical protein 


496 


82 


2636 


A0j758 


Homo sapiens 


serum albumin 


576 


91 


2637 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


39 


2638 


U93563 


Homo sapiens 


putative pi 50 


186 


38 


2639 


U68729 


Meloidogyne 
incognita 


cuticle preprocollagen 


113 


34 


2640 


AB033056 


Homo sapiens 


KIAA 1230 protein 


269 


94 


2641 


AF1 16712 


Homo sapiens 


PR02738 


115 


61 


2642 


W48353 


Homo sapiens 


Human breast cancer related protein 


124 


72 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








BCFLT2. 






9643 


G02455 


xnuillU oa.|JlGJ.lo 


xiuiiiaii bct/rcicti proiem, ocy invj. 
6536. 




JH- 


2644 


G03438 


TTomo saniens 


Human serrated nrotein SFO TF) NO m 
7519. 


127 


fid 


2645 


AK025116 


Homo sapiens 


unnamed nrotein nrorluct 


104 




2646 


X92485 


Plasmodium 
vivax 


pval 


100 


47 


2647 


AF130089 


Homo ^anien^ 


PRO2550 


122 




2648 


AF 194537 


T-fomo ^aniens 


NAG 13 


779 




2649 


Y14482 


Homo sapiens 


Fragment of human secreted protein 

encorlf^d hv cyi^nf* 17 


90 


66 


2650 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


148 


63 


2651 


AF1 16661 


Homo sapiens 


PR01438 


112 


46 


96 S9 


TT63S49 


Hnmn ennipne 
XTLUiliU bdJJlCllb 


r f\L piULClII 






2653 


AL3 59782 


Trypanosoma 


possible (hhv-6) ull02 ? variant a dna, 
cuiiipicit; virion genuine. 


133 


46 


2654 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
nomoiogue ^r^^i^ protein sequence. 


139 


72 


2655 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

66 1 3 

DO iJ, 


164 


72 


2656 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
Vimcm^ geric preouueu 


145 


51 


2657 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4/16. 


166 


64 


2658 


B25722 


Homo sapiens 


Human secreted protein sequence 
ciityuucQ uy gene y z>&\^j xs-J i\kj. i h. 


91 


88 


96SQ 


XQ94&S 


r la mi luuiuii i 

vivax 


pval 

§ 


1 A\ 


OK) 


9660 


V761 OR 
i /oi yo 


T— \c\XY\C\ C C> T\ 1 h»TI C 
I1UH1U bctplClIb 


T- Til tfYi fan OA/^roton nrnfoin on/>A/4arl 7 

r1u11id.11 bcL/rcieu protein encoQca oy 

CrpriP 7^S 
genu / .J . 


Oo 


oo 


2661 


Y30829 




Human Q&prpt&ft r.rr_fpf n i=»nr*nr..=«r1 frnm 
ppnp 19 




SO 


2662 


AF090931 


Homo ^aniens 


PRO0483 


1 14 


74. 


2663 


G03807 


Homo ^aniens 


Hum an senreteH nrrvre.in SIFO TO T\FO* 

X1.LJ.1JLICI.11 JijUl OUL/U JL^l v/Lvvlllj U.l_jV-£ 1 I. J llVy. 

7888. 


1 70 


S7 


2664 


AF090930 


Homo saoiens 


PRO0478 


170 


7R 


2665 


V00662 


Homo sanien*; 

A lwlliv i3CXL^l.wX10 


TTRT^ 1 riSTADT-T d^hvdrncpna^p QiiHnnit^ 


460 


77 


2666 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


368 


67. 


2667 


AF090931 


Homo ^aniens 


PRO0483 




o o 


2668 


AF010144 


T-Tomo santPiTs 

JLJlV/llIV/ DCipivUS 


lieuxvjiicii ujuuciu. u-'ivji.eiii fxi-j r v in ir 


1 ? 1 




2669 


G00397 


Homo ^anien*? 


Riiman Qprrpfpd TM'rvtpin ^IThO TO TMO* 

4478. 


1 60 


73 


2670 


AF 194537 


Homo <s aniens 


NAG 13 


1 70 


47 


2671 


X86003 


Rattus 
norvegicus 


np,i iron -dp.Hvpd rvrohan rpr.potnr 


104 


SO 


2672 


TJ63542 


A -HJAJ.lv/ OClj-/it'ilO 


A ill jjl wtCLll 


1 30 


7S 


2673 


AF090930 


Homo sanien^ 


PRO047X 


1 30 


60 

v7 


2674 


G03564 


Homo saniens 


T-Tnmfin ^errfted nrotein ST-i'O TO TsTO' 
7645. 


161 


7S 


2675 


A K 02445 S 


T-Tomo carMPnc 


PT T00047 nrntpin 


1 OS 


S3 


2676 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


142 


67 


2677 


AK000496 


Homo sapiens 


unnamed protein product 


no 


67 


2678 


AF1 16715 


Homo sapiens 


PR02829 


151 


78 


2679 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


72 


2680 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


110 


81 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








6613. 






2681 , 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


131 


75 


2682 


U76604 


Homo sapiens 


1 80 kDa bullous pemphigoid antigen 
2/type XVII collagen 


96 


37 


2683 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


85 


55 


2684 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


127 


80 


2685 


AF090895 


Homo sapiens 


PRO0117 


81 


51 


2686 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


119 


52 


2687 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


122 


41 


2688 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


145 


71 


2689 


AF130089 


Homo sapiens 


PRO2550 


149 


78 


2690 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


102 


47 


2691 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


116 


74 


2692 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


130 


60 


2693 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


106 


26 


2694 


AE001158 


Borrelia 
burgdorferi 


conserved hypothetical integral 
membrane protein 


81 


37 


2695 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


162 


76 


2696 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


114 


100 


2697 


AF194537 


Homo sapiens 


NAG 13 


126 


90 


2698 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


140 


69 


2699 


U93563 


Homo sapiens 


putative pi 50 


138 


37 


2700 


AF090928 


Homo sapiens 


PRO0470 


177 


57 


2701 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


130 


75 


2702 


X92485 


Plasmodium 
vivax 


pval 


116 


50 


2703 


AK024455 


Homo sapiens 


FLJ00047 protein 


103 


57 ! 


2704 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 2 (ND2) 


137 


81 


2705 


AF090895 


Homo sapiens 


PRO0117 


105 


51 


2706 


AF090944 


Homo sapiens 


PRO0663 


121 


45 


2707 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


104 


57 


2708 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


157 


85 


2709 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


283 


84 


2710 


AF130079 


Homo sapiens 


PR02852 


125 


48 


2711 


M22334 


Homo sapiens 


unknown protein 


131 


50 


2712 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


123 


58 


2713 


S80119 


Rattus sp. 


reverse transcriptase homoiog 


230 


57 


2714 


AF090931 


Homo sapiens 


PRO0483 


98 


85 


2715 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


104 


79 


2716 


L00016 


Homo sapiens 


urf5 


295 


93 


2717 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


135 


82 


2718 


M81321 


Macaca 
fascicularis 


proline-rich protein 


117 


38 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Id en tit 

y 


2719 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


124 


57 


ooon 


AT 'i^G^CO 
AJLjOy /oZ 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


1 An 

147 


56 


OOO 1 

Z/Zl 


VTiOO/1 Q 

YUZ /4y 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


132 


67 


Z I zz 


Yuz /4y 


Homo sapiens 
___ . 


Human secreted protein encoded by 
gene tuu clone riiNriuyo. 


1 OA 

129 


56 


Z f zo 


UD 0 1 1 z 


Homo sapiens 


cytochrome c oxidase subunit 1 


loo 

jyZ 


/TO 


Z f Z^r 


I OOZ4o 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 CO 
1 OZ 


OO 

79 


Z/Zj 


YoOZ4o 


Homo sapiens 


Human secreted, protein HCHrroo, 
SEQ ID NO: 163. 


1 c\ c 

195 


92 


Z /Zv 


Wy fZyj 


Homo sapiens 


An annexin binding protein (NABP-1). 


193 


40 


2727 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


547 


85 


0*70 0 

z/zo 


VUUooz 


Homo sapiens 


cytochrome oxidase I 


500 


92 


2729 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


110 


56 


2730 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


124 


75 


2731 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


113 


57 


2732 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


149 


51 


2733 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


101 


44 


2734 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


288 


81 


2735 


Z74036 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=58.9, E- 
vame=3.5e-14,N=3 


117 


37 


Z/30 


(jrU4UoJ 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O 1 A A 


154 


62 


2737 


AK024455 


Homo sapiens 


FLJ00047 protein 


136 


60 


OOQ Q 


CjOUD / / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

4oj5. 


110 


63 


2739 


AP002460 


Arabidopsis 
thaliana 


gene_id:FlD9.26~miknown protein 


363 


90 


2740 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


358 


69 


z/41 


UUZo^Z 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


143 


64 


Z/4Z 


T 0"7y10<? 

LrZ /4zo 


Homo sapiens 


reverse transcriptase 


119 


45 


2743 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone t± 1 L>AL>z2. 


76 


63 


2744 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJ W 1 8. 


155 


79 


2745 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
00 13. 


114 


51 


07/1/C 
Z/4o 


Y OOZ4o 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


159 


75 


z/4/ 


CjuU4J / 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

H J 10. 


162 


79 


2748 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


107 


49 


2749 


AK024455 


Homo sapiens 


FLJ00047 protein 


165 


69 


2750 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


167 


72 


2751 


M22332 


Homo sapiens 


unknown protein 


208 


46 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2752. 


Kl-3556 


Homo sapiens 


Protein encoded downstream of hhc JM 
oncoprotein . 


IOC 

125 


56 


2753 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJVI 
oncoprotein . 


150 


65 


ZtDH 


rlrZZUZO^f 


-— : 

Homo sapiens 


iViv^o I - 1 


l 


CO 

DZ 


Z / jj 




Homo sapiens 


rJLJUi)U4/ protein 


1 Z5 


CO 


2756 


AF1 30051 


Homo sapiens 


PRO0898 


162 


73 


Z/J / 


Olio 

JJJol IZ 


Homo sapiens 


NADH dehydrogenase subunit 3 


OCA 

259 


86 


Z/jo 


OU3 / 14 


Homo sapiens 


Human secreted protein, bbQ 1JD NO: 

77QC 


1 O A 

124 


58 


2759 


AJ243666 


Homo sapiens 


NICE-5 protein 


118 


84 


Z/oU 


JJiol IZ 


Homo sapiens 


NADH dehydrogenase subunit 4 


533 


90 


2761 


G02482 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
656j>. 


110 


39 


2762 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


257 


88 


2763 


Y73345 


Homo sapiens 


HTRM clone 438283 protein sequence. 


624 


67 


2764 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


156 


45 


2765 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


132 


58 


2766 


X92485 


Plasmodium 
vivax 


pval 


149 


53 


2767 


AF1 13685 


Homo sapiens 


PRO0974 


131 


63 


2768 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


109 


56 


2769 


G03243 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7324. 


154 


71 


2770 


X97707 


Pongo pygmaeus 
abelii 


stopcodon created by 
posttranscriptional polyadenylation 


185 


87 


ooo i 

2771 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz clone HMoJWlo. 


110 


59 


LI 1 £ 




Homo sapiens 


Human secreted protein, obQ ID NO: 
4718. 


1 A C 

145 


60 




a t?aaaa./i a 


Homo sapiens 


T>T> AA^T /T^ 


1 1 O 

118 


62 


2114 


L26953 


Homo sapiens 


chromosomal protein 


112 


58 


OOOC 

2775 


A T?A a a A"> a 

Ar 0909:>0 


Homo sapiens 


"AT> r\ A /• 1 o 

PRO 047 8 


127 


59 


2776 


AK024455 


Homo sapiens 


FLJ00047 protein 


154 


56 


2777 


R5984^ 


Homo sapiens 


ApoE4Lx2 protease. 


103 


42 


2778 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


101 


55 


2779 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene zz clone HMSJW18. 


131 


64 


2780 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

oo c o 


89 


56 


Z/oZ 


r^i q 1 1 1 


1 — j 

Pan troglodytes 


NADH dehydrogenase subunit 4 


194 


97 


Z/oo 


Y UZ / 4V 


Homo sapiens 


Human secreted protein encoded by 
gene iUU clone riJNrluyo. 


124 


68 


2784 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


123 


49 


Z/OJ 


U4JJOU 


Peromyscus 
maniculatus 


reverse transcriptase 


1 

175 


C A 

50 


2786 


W58700 


Homo sapiens 


Human ST-1 partial sequence. 


219 


100 


Z/o7 


O A<CO A A 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


137 


90 


/ oo 


ujo 1 IZ 


Homo sapiens 


cytochrome c oxidase subunit 3 


CIA 


/O 


2789 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


104 


44 


2790 


AF130089 


Homo sapiens 


PRO2550 


121 


39 


2791 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


135 


63 


2792 


G03052 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


124 


50 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7133. 






2793 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


153 


80 


2794 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


461 


84 


2795 


AF090931 


Homo sapiens 


PRO0483 


114 


76 


2796 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


107 


53 


2797 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


114 


78 


2798 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871, 


115 


71 


2799 


G02515 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6596. 


113 


79 


2800 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


117 


48 


2801 


X92485 


Plasmodium 
vivax 


pval 


97 


80 


2802 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


471 


80 


2803 


AF130051 


Homo sapiens 


PRO0898 


118 


78 


2804 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


78 


73 


2805 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


143 


69 


2806 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


102 


62 


2807 


K02401 


Homo sapiens 


chorionic somatomammotropin 


543 


88 


2808 


Y18522 


Corvus 
frugilegus 


cytochrome oxidase subunit I 


382 


60 


2809 


GO 1194 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5275. 


555 


93 


2810 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


399 


77 


2811 


AF007826 


Homo sapiens 


bax epsilon 


94 


39 


2812 


AF0 10144 


Homo sapiens 


neuronal thread protein AD7c~NTP 


220 


67 


2813 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


495 


81 


2814 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


112 


48 


2815 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


102 


75 


2816 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


437 


83 


2817 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


143 


66 


2818 


X55733 


Homo sapiens 


initation factor 4B 


458 i 


80 


2819 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


132 


65 


2820 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


347 


68 


2821 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


94 


62 


2822 


R13556 


Homo sapiens 


Protein encoded downstream of hhcJM 
oncoprotein. 


118 


54 


2823 


AF 130079 


Homo sapiens 


PR02852 


143 


54 


2824 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


140 


52 


2825 


Y76198 


Homo sapiens 


Human secreted protein encoded by 
gene 75. 


161 


67 


2826 


AF1 19851 


Homo sapiens 


PRO 1722 


104 


70 


2827 


D49489 


Homo sapiens 


human P5 


523 


91 


2828 


M10546 


Homo sapiens 


cytochrome oxidase I 


206 


81 


2829 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


138 


42 


2830 


AF090931 


Homo sapiens 


PRO0483 


97 


80 


2831 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


144 


65 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7871. 






2832 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


112 


35 ! 


2833 


X92485 


Plasmodium 

vivax 


pval 


103 


43 


2834 


R32010 


Homo sapiens 


Rpl5-TIA-1. 


152 


54 


2835 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


153 


59 


2836 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


148 


59 


2837 


AF090931 


Homo sapiens 


PRO0483 


149 


81 


2838 


AF130089 


Homo sapiens 


PRO2550 


123 


49 


2839 


AF090942 


Homo sapiens 


PRO0657 


135 


54 


2840 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


124 


54 


2841 


AF090895 


Homo sapiens 


PRO0117 


146 


55 


2842 


AF 132200 


Homo sapiens 


PR01751 


98 


74 


2843 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


194 


75 


2844 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


61 


59 


2845 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


171 


37 


2846 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


155 


69 


2847 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


141 


58 


2848 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


108 


65 


2849 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


441 


77 


2850 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


490 


88 


2851 


K02401 


Homo sapiens 


chorionic somatomammotropin 


488 


90 


2852 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


110 


49 


2853 


U83280 


Leishmania 
donovani 


39 kDa antigen 


131 


47 


2854 


AF090944 


Homo sapiens 


PRO0663 


173 


72 


2855 


V00662 


Homo sapiens 


ATPase 6 


392 


71 


2856 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


147 


50 


2857 


Y85062 


Homo sapiens 


Interleukin 1 converting enzyme 
homologue (ICEL) protein sequence. 


113 


76 


2858 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


178 


84 


2859 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


192 


92 


2860 


Z81068 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF00102 (Protein-tyrosine 
phosphatase), Score=232.1, E- 
value=2.6e-66, N=l~cDNA EST 
yk299h6.3 comes from this 
gene-cDNA EST yk420b4.3 comes 
from this gene-cDNA EST yk439g6.3 
comes from this gene-cDNA EST 
yk299h6.5 comes from this 
gene-cDNA EST yk420b4.5 comes 
from this gene-cDNA EST yk439g6.5 
comes from this gene 


104 


36 


2861 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


444 


85 


2862 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


168 


61 


2863 


U93564 


Homo sapiens 


putative pi 50 


132 


34 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2864 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


152 


50 


2865 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


124 


76 


2866 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


145 


46 


2867 




Homo sapiens 


PKO0478 


161 


61 


2ooo 


Yj0822 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


1 17 


48 


2869 


U15647 


Mus musculus 


reverse transcriptase 


197 


40 


2870 


U09500 


Homo sapiens 


cytochrome b 


394 


68 


2871 


T lAATAA 

U09500 


Homo sapiens 


cytochrome b 


614 


92 


2872 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


516 


87 


2873 


D38l 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


458 


84 


2874 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


516 


92 


2875 


Y30S22 


Homo sapiens 


Human secreted protein encoded from 
gene 12. 


87 


55 


2876 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


624 


96 


2877 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


160 


58 


2878 


U32174 


Dictyostelium 
discoideum 


non-receptor tyrosine kinase 


120 


33 


2879 


D38112 


Homo sapiens 


ATPase subunit 6 


141 


54 


2880 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


84 


2881 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


93 


79 


2882 


V00662 


Homo sapiens 


ATPase 6 


130 


87 


2883 


AL390114 


Leishmania 
major 


probable proteophosphoglycan 


137 


50 


2884 


AF1 19851 


Homo sapiens 


PRO 1722 


79 


55 


2885 


X924S5 


Plasmodium 
vivax 


pval 


106 


48 


2886 


V00662 


Homo sapiens 


ATPase 6 


127 


87 


2887 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


191 


83 


2888 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: „ 
7870. 


138 


58 


2889 


X92485 


Plasmodium 
vivax 


pval 


107 


63 


2890 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


163 


89 


2891 


AF090942 


Homo sapiens 


PRO0657 


72 


37 


2892 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


136 


56 


2893 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


139 


66 


2894 


AK024455 


Homo sapiens 


FLJ00047 protein 


139 


57 


2895 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


111 


44 


2896 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


182 


62 


2897 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


143 


73 




UJ o 1 LJ, 


— — : 

Homo sapiens 


cytochrome c oxidase subunit 3 






2899 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


366 


68 


2900 


X07882 


Homo sapiens 


Po protein 


146 


41 


2901 


V00662 


Homo sapiens 


cytochrome B 


517 


91 


2902 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 3 


378 


67 


2903 


AF090931 


Homo sapiens 


PRO0483 


117 


67 


2904 


AF003540 


Homo sapiens 


Krueppel family zinc finger protein 


99 


54 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2905 


AF130079 


Homo sapiens 


PR02852 


111 


55 


290o 


Lool 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


427 


79 


2yu / 




Homo sapiens 


nni onA 


140 


66 




Ar JjU 


Bodo saltans 


NADH dehydrogenase subunit 5 


1 lo 


41 






Homo sapiens 


FAP protein 


155 


75 




"Pv^ Q 1 1 A 


Gorilla gorilla 


cytochrome c oxidase subunit 1 (COI) 


A A£. 

44o 


72 


zy i 1 


AJrUuZDil 


Arabidopsis 
ttialiana 


gene_ia.K3JD2U.^~ 


1 1 o 
1 18 


35 


0019 
.Zi? 1Z 


1Z 


Homo sapiens 


_ : 

cytochrome c oxidase subunit 1 


A 1 ^ 




zy i j 


UjoI 10 


Pan pamscus 


cytochrome c oxidase subunit 1 


4o9 


0"» 

OJ 


2914 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


480 


80 


29 Id 


TVO Q 1 1 O 

L>3ol 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


A O O 

488 


86 


OO 1 c 

2916 


T"V~> Olio 

Dial 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


519 


80 


2917 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


172 


80 


2918 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


2919 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


458 


79 


2920 


G03797 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7878. 


152 


65 


292 1 


\ T {\f\/~- r_r^ 

V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


324 


87 


2922 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


530 


84 


2923 


Ar 157^21 


Homo sapiens 


30 kDa protein 


370 


68 


2924 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


174 


71 


2925 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


503 


88 


2926 


U47624 


Xenopus laevis 


alpha(E)-catenin 


243 


94 


2927 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


47 


2928 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


476 


78 


2929 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


465 


81 


2930 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


151 


58 


2931 


D16480 


Homo sapiens 


enoyl-CoA hydratase/3-hydroxyacyl- 

Alii ^ 1 1 • . 

CoA dehydrogenase alpha-subunit of 
trifunctional protein 


345 


67 


2932 


T~\0 O 1 1 O 

D381 12 


Homo sapiens 


"VTA TXTlr _J 1 J 1 *. A 

NADH dehydrogenase subunit 4 


212 


95 


2933 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


123 


69 


2934 


X83427 


Ornithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


460 


82 


2935 


D381 13 


Pan troglodytes 


cytochrome c oxidase subunit 1 


527 


83 


2936 


D38 112 


Homo sapiens 


NADH dehydrogenase subunit 4 


448 


92 


2937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


183 


77 


2938 


X15081 


Crithidia 
fasciculata 


MURF2 protein (AA 1-348) 


105 


42 


2939 


AF 194537 


Homo sapiens 


NAG 13 


151 


52 


2940 


AJ271872 


Nicotiana 
sylvestris 


extensin 


165 


37 


2941 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) j 


134 


91 


2942 


L>381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


387 


69 


2943 


r\'t o i 1 o 

D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


509 


79 


TO/1 /I 


X C>Oz45 


Homo sapiens 


Human secreted protein HCHrroo, 
SEQ ID NO: 163. 


IUd 


95 


2945 


J01415 


Homo sapiens 


MTND4 


429 


76 


2946 


AC010793 


Arabidopsis 
thaliana 


F20B17.16 


99- 


41 


2947 


AF116712 


Homo sapiens 


PR02738 


132 


60 


2948 


X89658 


Homo sapiens 


CAP- 18 protein 


136 


49 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2949 


V00662 


Homo sapiens 


cytochrome oxidase I 


449 


86 


2950 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


438 


82 


2951 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


363 


76 


2952 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla (AF07868j>) 


129 


92 


2953 


X56015 


Crithidia 
on c ope It i 


NADH dehydrogenase subunit 5 


102 


32 


2954 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


HO 


68 


2955 


Y76287 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 20. 


345 


78 


2956 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


340 


71 


2957 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


456 


83 


2958 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


102 


76 


2959 


G025j2 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


96 


84 


2960 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


400 


70 


2961 


X69978 


Homo sapiens 


XP-G factor 


539 


88 


2962 


AF203687 


Homo sapiens 


prolactin regulatory element-binding 
protein 


453 


77 


2963 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


561 


83 


2964 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


187 


92 


2965 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


1 19 


63 


2966 


U52077 


Homo sapiens 


mariner transposase 


469 


83 


2967 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCAC J8 1 . 


130 


53 


2968 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


42 


2969 


X92485 


Plasmodium 
vivax 


pval 


144 


42 


2970 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


127 


63 


2971 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


129 


63 


2972 


Y17221 


Homo sapiens 


Human secreted protein (clone fk3l7~ 
3). 


91 


56 


2973 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


101 


72 


2974 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A AO A 

44 84. 


130 


68 


2975 


APIO Anon 

Ar 130089 


Homo sapiens 


T>D AO CCA 


167 


46 


2976 


V 00662 


Homo sapiens 


cytochrome oxidase I 


539 


79 


29/7 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


121 


44 


29/8 


Y 86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


126 


72 


29/9 


T TOO C /: O 


Homo sapiens 


putative pi 50 


122 


30 


29b0 


\/T1 AAA 

Ml 2099 


Mus musculus 


proline-rich protein 


1 19 


39 


2981 


Z38128 


Mus musculus 


histone HI 


117 


36 


Z9<s2 


Ar 025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein KIO (NID:g8148) 


172 


51 


2983 


AC002291 


Arabidopsis 
thaliana 


extensin 


110 


35 


2984 


M10546 


Homo sapiens 


cytochrome oxidase I 


295 


92 


2985 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


171 


65 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


2986 


U18985 


Homo sapiens 


triadin 


416 


83 


2987 


AF 1 55232 


Pisum sativum 


extensin 


159 


41 


2988 


X03145 


Homo sapiens 


pot. ORF V 


133 


44 


2989 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


565 


84 


2990 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


127 


53 


2991 


D38l 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


538 


83 


2992 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


471 


75 


2993 


D381 13 


Pan troglodytes 


cytochrome c oxidase subunit 1 


511 


72 


2994 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


301 


79 


2995 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


526 


78 


2996 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


634 


82 


2997 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68,« 
SEQ ID NO: 163. 


158 


76 


2998 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


103 


50 


2999 


V00662 


Homo sapiens 


cytochrome oxidase I 


535 


78 


3000 


L38908 


Nicotiana 
tabacum 


extensin 


146 


38 


3001 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


494 


87 


3002 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


111 


66 


3003 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


117 


46 


3004 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


93 


47 


3005 


U43627 


Arabidopsis 
thaliana 


extensin 


118 


37 


3006 


AF1 15549 


Homo sapiens 


Wiskott-Aldrich Syndrome protein 


150 


44 


3007 


G03556 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7637. 


114 


73 


3008 


W54966 


Homo sapiens 


Synthetic human type III coHagen 
SYN-C3. 


111 


39 


3009 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


84 


3010 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


212 


77 


3011 


X6571S 


Primus dulcis 


extensin 


129 


42 


3012 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


71 


76 


3013 


Z38128 


Mus museums 


histone HI 


104 


32 


3014 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


102 


34 


3015 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


80 


3016 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


99 


34 


3017 


X55685 


Lycopersicon 
esculentum 


extensin (class I) 


142 


32 


3018 


AJ133050 


Panulirus argus 


cytochrome c oxidase subunit I 


125 


71 


3019 


G03597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7678. 


175 


68 


3020 


U93564 


Homo sapiens 


putative pi 50 


97 


57 


3021 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


555 


82 


3022 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


147 


76 


3023 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


632 


82 


3024 


AF 155232 


Pisum sativum 


extensin 


181 


42 


3025 


V00662 


Homo sapiens 


cytochrome oxidase I 


567 


85 
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SEQ IB 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3026 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


130 


58 


3027 


W00838 


Homo sapiens 


Tumour necrosis factor-related gene 
product. 


141 


67 


302 o 


Ar 1 J0089 


Homo sapiens 


PRO2550 


149 


55 


5\)Zy 




Homo sapiens 






76 


3030 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


122 


92 


3031 


T~\i o 1 1 O 

D3ol 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


507 


91 


3032 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 2 


351 


87 


3033 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


99 


75 


3034 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3035 


U93567 


Homo sapiens 


putative pi 50 


173 


41 


3036 


X92485 


Plasmodium 
vivax 


pval 


122 


41 


3037 


AF202635 


Homo sapiens 


PP1200 


128 


67 


3038 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


151 


83 


3039 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


415 


74 


3040 


D86853 


Catharanthus 
roseus 


extensin 


198 


40 


3041 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


110 


47 


3042 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


377 


95 


3043 


S80119 


Rattus sp. 


reverse transcriptase homolog 


106 


50 


3044 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


137 


55 


3045 


Y36156 


Homo sapiens 


Human secreted protein #28. 


91 


58 


3046 


U93574 


Homo sapiens 


putative pi 50 


177 


49 


3047 


AF063866 


Melanoplus 
sanguinipes 
entomopoxvirus 


ORF MSV233 hypothetical protein 


98 


37 


3048 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


568 


87 


3049 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


144 


63 


3050 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


144 


80 


3051 


AK027208 


Homo sapiens 


unnamed protein product 


145 


72 


3052 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


166 


75 


3053 


AF 118082 


Homo sapiens 


PRO 1902 


84 


50 


3054 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


118 


41 


3055 


X92485 


Plasmodium 
vivax 


pval 


132 


52 


3056 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


117 


74 


3057 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


99 


72 


3058 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


627 


89 


3059 


G00332 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4413. 


98 


57 


3060 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


100 


38 


3061 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


159 


80 


3062 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


529 


80 


3063 


AF130051 


Homo sapiens 


PRO0898 


159 


71 


3064 


AF195418 


Mus musculus 


ODZ3 


386 


94 


3065 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 


109 


51 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








clone HCACJ8L 






3066 


D381 12 


Homo sapiens 


-VTA T~"\T T J 1 J "L " i s\ 

N ADH dehydrogenase subunit 2 


338 


80 


3067 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


120 


50 


j>(J6o 


U 15647 


Mus musculus 


reverse transcriptase 


IT,) 

134 


41 




UJbliZ 


Homo sapiens 


cytochrome c oxidase subunit 3 


397 


92 


3070 


AF 130089 


Homo sapiens 


PRO2550 


149 


55 


3071 


U93565 


Homo sapiens 


putative pi 50 


143 


37 


3072 


G03133 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7214. 


112 


45 


3073 


AF090930 


Homo sapiens 


PRO0478 


135 


78 


3074 


AF130089 


Homo sapiens 


PRO2550 


156 


58 


3075 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


267 


65 


3076 


U83280 


Leishmania 
donovani 


39 kDa antigen 


98 


51 


3077 


Dj>81 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


626 


86 


3078 


AF1 16712 


Homo sapiens 


PR02738 


114 


59 


3079 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


112 


46 


3080 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


492 


80 


3081 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


523 


86 


3082 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


515 


79 


3083 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homolog 


150 


41 


3084 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


191 


97 


3085 


U12690 


Homo sapiens 


cytochrome oxidase subunit 11 


537 


78 


3086 


D38112 


Homo sapiens 


ATPase subunit 6 


300 


58 


3087 


X86031 


Sus scrofa 


homologue of proline/arginine-rich 
antibacterial protein 


104 


37 


3088 


K02247 


Rattus 
norvegicus 


proline-rich protein 


108 


41 


3089 


W4835 1 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


48 


3090 


V00662 


Homo sapiens 


cytochrome oxidase I 


597 


84 


3091 


x x i r» r\c\ a 

M 1 8094 


Phaseolus 
vulgaris 


r hydroxyproline-rich glycoprotein 


221 


43 


3092 


D381 13 


Pan troglodytes 


cytochrome c oxidase subunit 1 


392 


59 


3093 


D38116 


Pan paniscus 


cytochrome c oxidase subunit 1 


593 


84 


3094 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


471 


76 


3095 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


369 


98 


3096 


L26953 


Homo sapiens 


chromosomal protein 


115 


57 


3097 


G00591 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4672. 


124 


57 


3098 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


152 


91 


3099 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


153 


96 


3100 


AC006233 


Arabidopsis 
thaliana 


hypothetical protein 


65 


44 


3101 


AF289098 


Cladrastis 
kentukea 


ENOD2 


107 


35 


3102 


AF155232 


Pisum sativum 


extensin 


142 


43 


3103 


AF130089 


Homo sapiens 


PRO2550 


122 


81 


3104 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


143 


80 


3105 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


104 


59 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3106 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 12 


76 


3107 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


117 


72 


3108 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


334 


85 


3109 


U43627 


Arabidopsis 
thaliana 


extensin 


112 


30 


3110 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


118 


100 


3111 


AF 2 3 0089 


Homo sapiens 


PRO2550 


137 


37 


3112 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


220 


93 


3113 


A18812 


Brassica napus 


extensin 


128 


32 


3114 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


176 


63 


3115 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


92 


69 


3116 


X92485 


Plasmodium 
vivax 


pval 


148 


45 


3117 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


574 


80 


3118 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


125 


41 


3119 


U43627 


Arabidopsis 
thaliana 


extensin 


129 


32 


3120 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


171 


82 


3121 


Y12950 


Homo sapiens 


Amino acid sequence of a human 
secreted peptide. 


90 


59 


3122 


AF130051 


Homo sapiens 


PRO0898 


134 


82 


3123 


M81321 


Macaca 
fascicularis 


proline-rich protein 


158 


48 


3124 


U93563 


Homo sapiens 


putative pi 50 


126 


32 


3125 


D38114 


Gorilla gorilla 


NADH dehydrogenase subunit 1 (ND1) 


337 


92 


3126 


AF130089 


Homo sapiens 


PRO2550 


155 


77 


3127 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


76 


3128 


AF090901 


Homo sapiens 


PRO0195 


109 


53 


3129 


AF003736 


Murine leukemia 
virus 


reverse transcriptase 


164 


40 


3130 


U25281 


Rattus 
norvegicus 


SH3 domain binding protein 


88 


39 


3131 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


116 


71 


3132 


D86853 


Catharanthus 
roseus 


extensin 


110 


33 


3133 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


506 


76 


3134 


V00662 


Homo sapiens 


cytochrome oxidase I 


583 


88 


3135 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


103 


68 


3136 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


282 


60 


3137 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


119 


54 


3138 


AF1 18082 


Homo sapiens 


PRO 1902 


118 


51 


3139 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


270 


87 


3140 


AL3 59782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


48 
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SEQ 10 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3141 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit5 


353 


74 


3142 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


126 


77 


ol4o 


Y 86248 


Homo sapiens 


Human secreted protein HCKrroo, 
11J JNU. loo. 


i ir 

1 15 


72 


3144 


AF090930 


Homo sapiens 


PRO0478 


138 


73 


O 1 /I c 


A IT 1 C CO "> O 


Pisum sativum 


extensin 


1 1 A 

110 


34 


3146 


M69008 


Homo sapiens 


alpha- 1 type XIII collagen 


107 


37 


3147 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


103 


63 


3148 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


505 


84 


3149 


M77194 


Rat leukemia 
virus 


polymerase 


167 


35 


3150 


A TIAAO C A O 

AP002543 


Arabidopsis 
thaliana 


gb|AAD23015. 1~ gene _id:F15M7. 16~si 
milar to unknown protein 


105 


31 


3151 


AK024455 


Homo sapiens 


FLJ00047 protein 


109 


59 


3152 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


89 


39 


3153 


J04794 


Homo sapiens 


aldehyde reductase (EC I . I . L2) 


137 


58 


3154 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


115 


70 


3155 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


160 


53 


3156 


X92485 


Plasmodium 
vivax 


pval 


104 


50 


3157 


AF 194537 


Homo sapiens 


NAG 13 


207 


60 


3158 


AF063693 


Mus musculus 


type XIII collagen 


104 


37 


3159 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


130 


91 


3160 


AF020191 


Mus musculus 


proline-rich protein 1 3 


107 


51 


3161 


AFl94:>37 


Homo sapiens 


NAG 13 


170 


76 


3162 


f—-\ ri r 

G025o5 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6616. 


103 


41 


3 163 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


161 


58 


O 1 «C A 

3 164 


Ka)3ZUj 


Homo sapiens 


salivary proline-rich protein precursor 


139 


46 


3165 


U93570 


Homo sapiens 


putative pi 50 


151 


68 


3166 


Y 19767 


Homo sapiens 


SEQ ID NO 485 rrom W09922243. 


144 


71 


3 167 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A O Q 

443 o. 


147 


66 


3 168 


T TO/IOO 

Lz /42o 


Homo sapiens 


reverse transcriptase 


A A 

94 


51 


3169 


X97675 


Homo sapiens 


plakophilin 2b 


103 


76 


3170 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 

od^ 1U iNU.IOJ. 


115 


54 


3171 


AK024455 


Homo sapiens 


FLJ00047 protein 


98 


61 


3 1 /Z 


A T? A 1 A 1 A A 

Ar U 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


i a a 

146 


38 


O 1 TO 

3 173 


jDi olio 


Pan paniscus 


NADH dehydrogenase subunit 4 


A £n 

467 


81 


Jl /4 


X9Z485 


Plasmodium 
vivax 


pval 


88 


69 


j175 


L26953 


Homo sapiens 


chromosomal protein 


124 


78 


O 1 'O/T 

J 176 


A TO A /1 O 1 /CA 

Ar 042 1 69 


Homo sapiens 


putative ATP-dependent mitochondrial 
RNA helicase 


223 


95 


3177 


AF130079 


Homo sapiens 


PR02852 


96 


44 


Jl /o 




Mus musculus 


cell proliferation related protein CAP 


y I 


Jo 


3179 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


90 


69 


3180 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


146 


78 


3181 


AF 118082 


Homo sapiens 


PRO 1902 


150 


58 


3182 


U60315 


Molluscum 
contagiosum 


MC107L 


101 


44 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






virus subtype 1 








3183 


T T A t~\ f\ *"*T s\ 

U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


148 


61 


3184 


U93565 


Homo sapiens 


putative pi 50 


265 


58 


3185 


Y86248 


Homo sapiens 


Human secreted protem HCHPF68, 
SEQ ID NO: 163. 


117 


79 


3186 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


167 


39 


3187 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
/870. 


160 


63 


3188 


S80119 


Rattus sp. 


reverse transcriptase homolog 


152 


49 


3189 


X05300 


Rattus 
norvegicus 


ribophorin I 


122 


46 


3190 


B01372 


Homo sapiens 


Neuron-associated protein. 


106 


80 


3191 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


130 


83 


3192 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


135 


80 


3193 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


98 


34 


3194 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


140 


87 


3195 


AF022985 


Caenorhabditis 
elegans 


Similar to collagen 


97 


40 


3196 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


452 


81 


3197 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


110 


58 


3198 


X99452 


Lycopersicon 
esculentum 


extensin-like protein Dif54 


100 


36 


3199 


D86853 


Catharanthus 
roseus 


extensin 


119 


36 


3200 


AF025424 


Rattus 
norvegicus 


RNA polymerase I 127 kDa subunit 


215 


74 


3201 


1593563 


Homo sapiens 


putative pi 50 


359 


77 


3202 


K03202 


Homo sapiens 


salivary proline-rich protein precursor 


112 


37 


3203 


G00500 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4581. 


153 


84 


3204 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


128 


39 


3205 


G00637 


Homo sapiens 


Human secreted protem, SEQ ID NO: 
4718. 


154 


76 


3206 


Y01 158 


Homo sapiens 


Secreted protem encoded by gene 1 8 
clone HCACJ8L 


127 


64 


3207 


X97675 


Homo sapiens 


plakophilin 2b 


142 


51 


3208 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


91 


58 


3209 


AB026512 


Ecnomiosa sp. 

T? i 

Ecnl 


cytochrome c oxidase subunit I 


254 


62 


3210 


G0j714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7795. 


163 


59 




D381 12 


Homo sapiens 


ATPase subunit 6 


447 


78 


•ion 

3212 


V00662 


Homo sapiens 


ATPase 6 


482 


83 




UUJ4JJ5 


Homo sapiens 


Mum an secreted protein, oiiQ ID JND. 
7519. 


1 < i 
151 


DO 


3214 


D38112 


Homo sapiens 


ATPase subunit 6 


400 


72 


3215 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 j 


88 


3216 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 5 


472 


82 


3217 


AF090930 


Homo sapiens 


PRO0478 


96 


64 


3218 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


137 


46 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7253. 






3219 


AF006061 


Homo sapiens 


placental growth hormone isoform 
hGH-V3 


290 


68 


3220 


GOO j 65 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


1 15 


63 


3221 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


441 


71 


3222 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


111 


59 


3223 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


497 


81 


3224 


L26953 


Homo sapiens 


chromosomal protein 


95 


59 


3225 


U83280 


Leishmania 
donovani 


39 kDa antigen 


1 17 


90 


3226 


AF239615 


Manihot 
esculenta 


CRANTZ hydroxyproline-rich 
glycoprotein 


124 


43 


3227 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


442 


74 


3228 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


172 


67 


3229 


X92591 


Mus musculus 


transcription factor 


109 


43 


3230 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


81 


54 


3231 


D38115 


Pongo pygmaeus 


cytochrome c oxidase subunit 1 


476 


72 


3232 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


54 


3233 


U96416 


Dennyus 
distinctus 
timjonesi 


cytochrome b 


95 


40 


3234 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


161 


86 


3235 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


101 


40 


3236 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


149 


75 


3237 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


532 


76 


3238 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


576 


89 


3239 


U93567 


Homo sapiens 


p40 


155 


32 


3240 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


146 


79 


3241 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


65 


3242 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


76 


62 


3243 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


138 


69 


3244 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


115 


39 


3245 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


138 


54 


3246 


G02211 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6292. 


153 


69 


3247 


U58736 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


107 


37 


3248 


U87607 


Rattus 
norvegicus 


putative RNA binding protein 1 


148 


45 


3249 


S75895 


Homo sapiens 


NADH dehydrogenase subunit 2, ND2 


113 


85 


3250 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


166 


61 


3251 


U15183 


Mycobacterium 
leprae 


proline-rich antigen 


109 


40 


3252 


AF130089 


Homo sapiens 


PRO2550 


151 


73 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3253 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


I3l 


58 


3254 


Y86248 


Homo sapiens 


Human secreted protein HCHPF6S, 
SEQ ID NO: 163. 


162 


80 


3255 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rha la. 


108 


57 


3256 


J0I415 


Homo sapiens 


cytochrome oxidase subunit 3 


1 56 


88 


3257 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


206 


93 


3258 


Y36203 


Homo sapiens 


Human secreted protein #75. 


111 


77 


3259 


W03642 


Homo sapiens 


Human cannabinoid GPR N-terminal 
sequence. 


98 


57 


3260 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


345 


67 


3261 


Y86248 

« 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


97 


56 


3262 


L38908 


Nicotiana 
tabacum 


extensin 


1 55 


40 


3263 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


95 


70 


3264 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


87 


3265 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


1 15 


69 


3266 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


158 


59 


3267 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


105 


56 


3268 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


159 


72 


3269 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


200 


95 


3270 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


138 


73 


3271 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


151 


90 


3272 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


193 


95 


3273 


Y3 63 66 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


111 


52 


3274 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


148 


87 


3275 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


161 


86 


3276 


S62928 


Homo sapiens 


PRB1M protein precursor 


104 


34 


3277 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


154 


55 


3278 


G02755 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6836. 


166 


100 


3279 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


212 


91 


3280 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


118 


50 


3281 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
bJbQ ID NO:163. 


130 


46 


3282 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


303 


95 


3283 


D38112 


Homo sapiens 


cytochrome c oxidase subunit l 


455 


81 


3284 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


502 


83 


3285 


AF 118086 


Homo sapiens 


PR01992 


99 


88 


3286 


D38112 


Homo sapiens 


cytochrome c oxidase subunit l 


489 


83 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3287 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


485 


88 


3288 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


3289 


AL3597S2 


Trypanosoma 
brucei 


possible (hhv-6) ull02, variant a dna, 
complete virion genome. 


158 


60 


3290 


X92485 


Plasmodium 
vivax 


pval 


99 


58 


3291 


AF1 16712 


Homo sapiens 


PRQ2738 


102 


51 


3292 


AF119851 


Homo sapiens 


PRO 1722 


99 


86 


3293 


AF090930 


Homo sapiens 


PRO0478 


141 


72 


3294 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


151 


66 


3295 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


107 


41 


3296 


AK024455 


Homo sapiens 


FLJ00047 protein 


97 


53 


3297 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


156 


60 


3298 


AC007654 


Arabidopsis 
thaliana 


T19E23.7 


97 


49 


3299 


Y17221 


Homo sapiens 


Human secreted protein (clone fk317- 
3)- 


96 


44 


3300 


AJ243905 


Caenorhabditis 
elegans 


SF1 protein 


104 


42 


3301 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


96 


41 


3302 


X89453 


Rattus 
norvegicus 


DRPLA 


83 


65 


3303 


U10099 


Homo sapiens 


POM-ZP3 


227 


52 


3304 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


553 


89 


3305 


L27428 


Homo sapiens 


reverse transcriptase 


144 


43 


3306 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


388 


71 


3307 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


439 


78 


3308 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


439 


82 


3309 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


492 


88 


3310 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


81 


65 


3311 


X53581 


Rattus 
norvegicus 


ORF4 


140 


48 


3312 


V00662 


Homo sapiens 


cytochrome B 


487 


88 


3313 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


451 


80 


3314 


V00662 


Homo sapiens 


cytochrome oxidase I 


524 


88 


3315 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


518 


87 


3316 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


115 


50 


3317 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


356 


69 


3318 


U12693 


Homo sapiens 


cytochrome oxidase subunit II 


477 


78 


3319 


Z67990 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01484 (Nematode cuticle collagen 
N-terminal domain), Score=27.5, E- 
value=0.0001,N=l 


75 


40 


3320 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


126 


75 


3321 


AF090930 


Homo sapiens 


PRO0478 


166 


65 


3322 


X92485 


Plasmodium 
vivax 


pval 


104 


44 


3323 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


513 


80 


3324 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


483 


79 
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SEQ IB 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3325 


AF130089 


Homo sapiens 


PRO2550 


121 


88 


3326 


AC026234 


Unknown 


Contains weak similarity to an 
unknown protein T23E1 8.5 


191 


66 


3327 


V00662 


Homo sapiens 


cytochrome B 


439 


72 


3328 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


316 


75 


3329 


AF091711 


Homo sapiens 


splice variant AKAP3 50 


106 


52 


3330 


U93565 


Homo sapiens 


putative pi 50 


125 


45 


3331 


AF1 19851 


Homo sapiens 


PR01722 


88 


72 


3332 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


489 


81 


3333 


V00662 


Homo sapiens 


cytochrome oxidase I 


544 


90 


3334 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


475 


80 


3335 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


168 


66 


3336 


AF146688 


Takifugu 
rubripes 


serine-threonine kinase 9 


101 


43 


3337 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


101 


50 


3338 


G00382 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4463. 


110 


50 


3339 


V00662 


Homo sapiens 


cytochrome oxidase I 


458 


80 


3340 


K02401 


Homo sapiens 


chorionic somatomammotropin 


209 


97 


3341 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


112 


52 


3342 


X65165 


Volvox carteri 


extensin 


146 


41 


3343 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


119 


51 


3344 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


559 


84 


3345 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


508 


89 


3346 


Ml 1897 


Mus museums 


proline-rich salivary protein 


96 


41 


3347 


U92455 > 


Mus musculus 


WW domain binding protein 7; WBP7 


113 


38 


3348 


U83280 


Leishmania 
donovani 


39 kDa antigen 


105 


40 


3349 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


113 


38 


3350 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


542 


79 


3351 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


664 


88 


3352 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


117 


85 


3353 


A18812 


Brassica napus 


extensin 


103 


31 


3354 


M10546 


Homo sapiens 


cytochrome oxidase I 


125 


78 


3355 


M81321 


Macaca 
fascicularis 


proline-rich protein 


107 


44 


3356 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


284 


90 


3357 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


513 


80 


3358 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


541 


79 


3359 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


543 


87 


3360 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


563 


86 


3361 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


179 


57 


3362 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


125 


62 


3363 


A T"*/"\ 1 AAA 

AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


151 


46 


3364 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


569 


82 


3365 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


102 


40 


3366 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


143 


76 


3367 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


145 


84 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


1 Smith- 
Waterman 
Score 


% 

Identit 

y 


3368 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


587 


87 


3369 


D381 12 


Homo sapiens 


cytochrome c oxidase subunit 1 


549 


82 


3370 


AL035526 


Arabidopsis 
thaliana 


extensin-like protein 


130 


39 


35 i 1 


JU4o4:> 


Homo sapiens 


synexm 


101 


40 


3372 


Z29573 


Didelphis 
virginiana 


cytochrome c oxidase subunit 3 


154 


71 


3373 


U93568 


Homo sapiens 


putative pi 50 


176 


59 


3374 


L28748 


Bos taurus 


putative 


146 


67 


3375 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


512 


75 


3376 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


509 


78 


3377 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


135 


51 


3378 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


557 


82 


3379 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


120 


35 


3380 


X92485 


Plasmodium 

vivax 


pval 


124 


38> 


3381 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


110 


78 


3382 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


108 


67 


3383 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


115 


86 


3384" 


X98296 


Homo sapiens 


ubiquitin hydrolase 


238 


83 


3385 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


494 


75 


3386 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


155 


66 


3387 


AL359782 


Trypanosoma 
brucei 


probable similar to ring~h2 finger 
protein rhala. 


116 


65 


3388 


W73624 


Homo sapiens 


Human secreted protein clone aw921. 


327 


62 


3389 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


141 


76 


3390 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


73 


61 


3391 


U35730 


Mus musculus 


jerky 


152 


30 


3392 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


100 


41 


3393 


U97674 


Mesocricetus 
auratus 


cytochrome c oxidase chain I 


483 


77 


3394 


D38H3 


Pan troglodytes 


cytochrome c oxidase subunit 1 


539 


81 


3395 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


608 


87 


3396 


X71602 


Nicotiana 
tabacum 


extensin 


104 


35 


3397 


L26953 


Homo sapiens 


chromosomal protein 


100 


60 


3398 


AF197832 


Cyanocitta 
cristata 


cytochrome oxidase I 


488 


76 


3399 


X53581 


Rattus 
norvegicus 


ORF4 


140 


42 


3400 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


597 


85 


3401 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


656 


89 


3402 


L26953 , 


Homo sapiens 


chromosomal protein 


117 


63 




Ar vD22y& 


Drosophila 
silvestris 


Y box protein 


114 


30 


3404 


U43360 


Peromyscus 
maniculatus 


reverse transcriptase 


133 s 


53 


3405 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


151 


65 


3406 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


277 


82 


3407 


U01849 


Trypanosoma 


ORF2 


94 


30 
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SEQ 10 

NO: 


Accession No. 


Species 


Description 


Smith- 

Waterman 

Score 


% 

Identit 

y 






brucei 








3408 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


115 


65 


3409 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


189 


86 


3410 


U93570 


Homo sapiens 


putative pi 50 


233 


52 


3411 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


104 


44 


3412 


AF090944 


Homo sapiens 


PRO0663 


97 


56 


3413 


X61048 


Hydra sp. 


mini-collagen 


128 


44 


3414 


AF1 18086 


Homo sapiens 


PRO 1992 


128 


81 


3415 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


145 


61 


3416 


AF 116695 


Homo sapiens 


PR02221 


169 


69 


3417 


U31778 


Human 

papillomavirus 
type 20 


putative 


107 


50 


3418 


X92485 


Plasmodium 
vivax 


pval 


97 


54 


3419 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


195 


.94 


3420 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


144 


69 


3421 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


139 


71 


3422 


G02409 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6490. 


111 


63 


3423 


X15917 


Paramecium 
aurelia 


ORF4 protein (AA 1-156) 


93 


41 


3424 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


99 


69 


3425 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


115 


60 


3426 


Y14486 


Homo sapiens 


cytosolic serine 
hydroxymethyltransferase 


196 


61 


3427 


U93569 


Homo sapiens 


putative pi 50 


110 


44 


3428 


G03415 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7496. 


120 


67 


3429 


ABO 1 2223 


Canis familiaris 


ORF2 


169 


44 


3430 


X97675 


Homo sapiens 


plakophilin 2b 


118 


60 


3431 


AF220264 


Homo sapiens 


MOST-1 


153 


72 


3432 


X64173 


Zea 

diploperennis 


hydroxyproline-rich glycoprotein 


104 


41 


3433 


a v i 1 ■*> (Tor 

AFl I3685 


Homo sapiens 


PRO0974 


104 


72 


"> A ~> A 

34^4 


AB002306 


Homo sapiens 


TJTT A A AO Aft 

KIAA0308 


282 


83 


3435 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


1 13 


83 


^436 


ABO 12223 


Canis familiaris 


ORF2 


108 


43 


34 d 1 


AK024455 


Homo sapiens 


TpT TA AA A f~f j ■ 

FLJ00047 protein 


100 


65 


3438 


U52077 


Homo sapiens 


mariner transposase 


381 


64 


3439 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


425 


84 


3440 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


63 


3441 


G02624 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6705. 


90 


78 


3442 


Y36156 


Homo sapiens 


Human secreted protein #28. 


126 


71 


3443 


AF191032 


Myxine 
glutinosa 


RING3 


84 


63 


3444 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


141 


57 


3445 


Y08061 


Homo sapiens 


Human c-myb protein fragment. 


88 


65 
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3446 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


157 


67 


3447 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


281 


86 


3448 


A T~* 1 10/101 

AF1 12481 


Homo sapiens 


RAD54B protein 


392 


87 


3449 


G00497 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4578. 


98 


55 


3450 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


133 


90 


3451 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


418 


82 


3452 


AF 13 0079 


Homo sapiens 


PR02852 


110 


80 


3453 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


143 


73 


3454 


AB032254 


Homo sapiens 


bromodomain adjacent to zinc finger 
domain 2A 


447 


85 


3455 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


134 


62 


3456 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


79 


68 


3457 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


545 


84 


3458 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


134 


83 


3459 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


277 


80 


3460 


AK000867 


Homo sapiens 


unnamed protein product 


474 


98 


3461 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


123 


52 


3462 


AF041330 


Bodo saltans 


NADH dehydrogenase subunit 5 


89 


30 


3463 


U93568 


Homo sapiens 


putative pi 50 


100 


46 


3464 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


156 


43 


3465 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


340 


88 


3466 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


119 


100 


3467 


B01372 


Homo sapiens 


Neuron-associated protein. 


142 


75 


3468 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


103 


40 


3469 


G00357 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4438. 


139 


80 


3470 


W54282 


Homo sapiens 


Protein sequence of the di~alpha 
haemoglobin gene contained in pSSl . 


207 


89 


3471 


AF116661 


Homo sapiens 


PR01438 


119 


47 


3472 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


129 


59 


3473 


AF130089 


Homo sapiens 


PRO2550 


108 


86 


3474 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


1 13 


80 


'y An c 
3475 


V 00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


590 


79 


3476 


AF 118086 


Homo sapiens 


PRO 1992 


126 


69 


3477 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


147 


47 


3478 


J01415 


Homo sapiens 


MTND4 


482 


78 


3479 


X03404 


Bos taurus 


alpha subunit (aa 1-394) 


583 


89 


3480 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


514 


84 


3481 


G03628 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7709. 


242 


80 


3482 


G00407 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4488. 


121 


61 


3483 


X83427 


Ornithorhynchus 
anatinus 


cytochrome c oxidase subunit 1 


508 


78 


3484 


Y07754 


Homo sapiens 


Human secreted protein fragment 


549 


93 
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y 








encoded from gene 1 1 . 






3485 


X97567 


Homo sapiens 


porl 


388 


69 


3486 


X97567 


Homo sapiens 


porl 


608 


84 


3487 


K02401 


Homo sapiens 


chorionic somatomammotropin 


641 


93 


3488 


AFO 10144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


135 


71 


( 3489 


S62941 


Homo sapiens 


Ps 2=basic proline-rich protein(PRBlL 
precursor protein=basic proline-rich 
proteins (Ps ? PmF, PmS, and Pe) 
precursor) {C-terminal} 


116 


36 


3490 


S74728 


Homo sapiens 


antiquitin=26g turgor protein homolog 


549 


84 


3491 


L13329 


Homo sapiens 


iduronate-2-sulfatase 


564 


85 


3492 


X79535 


Homo sapiens 


beta tubulin 


620 


88 


3493 


AF08I484 


Homo sapiens 


alpha-tubulin isoform 1 


578 


87 


3494 


U09823 


Oryctolagus 
cuniculus 


elongation factor 1 alpha 


631 


89 


3495 


AF081484 


Homo sapiens 


alpha-tubulin isoform 1 


616 


90 


3496 


M12140 


Homo sapiens 


envelope protein 


430 


53 


3497 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


123 


70 


3498 


W48352 


Homo sapiens 


Human breast cancer related protein 
BCFLT1. 


104 


51 


3499 


R13556 


Homo sapiens 


Protein encoded downstream of hlicJVI 
oncoprotein. 


134 


54 


3500 


Y36156 


Homo sapiens 


Human secreted protein #28. 


143 


75 


3501 


AF1 13685 


Homo sapiens 


PRO0974 


77 


81 


3502 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


157 


62 


3503 


AF090944 


Homo sapiens 


PRO0663 


164 


81 


3504 


AF090931 


Homo sapiens 


PRO0483 


103 


70 


3505 


U93570 


Homo sapiens 


p40 


258 


47 


3506 


AB016601 


Drosophila 
alpina 


cytochrome c oxidase subunit I 


108 


82 


3507 


ABO 16601 


Drosophila 
alpina 


cytochrome c oxidase subunit I 


108 


82 


3508 


AF194537 


Homo sapiens 


NAG 13 


449 


78 


3509 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


153 


60 


3510 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


102 


37 


3511 


AC008113 


Arabidopsis 
thaliana 


F12A21.10 


90 


48 


3512 


M64791 


Rattus 
norvegicus 


salivary proline-rich protein 


104 


33 


3513 


AF090944 


Homo sapiens 


PRO0663 


115 


52 - 


3514 


AF240630 


Mus musculus 


IQ motif containing GTPase activating 
protein 1 


152 


67 


3515 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


123 


48 


3516 


AF090942 


Homo sapiens 


FRO0657 


128 


47 


3517 


Ml 0546 


Homo sapiens 


cytochrome oxidase I 


286 


56 


3518 


AF255661 


Crypthecodinium 
cohnii 


Dinapl -interacting protein 5; Dip5 


108 


36 


3519 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


209 


93 


3520 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


102 


38 


3521 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


131 


64 


3522 


AL359782 


Trypanosoma 


possible (hhv-6) ul 102, variant a dna, 


105 


42 
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% 
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y 






brucei 


complete virion genome. 






3523 


R95913 


Homo sapiens 


Neural thread protein. 


118 


72 


3524 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:l63. 


115 


59 


3525 


L27428 


Homo sapiens 


reverse transcriptase 


162 


50 


3526 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


138 


58 


3527 


AF090930 


Homo sapiens 


PRO0478 


146 


66 


3528 


G00528 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4609. 


169 


50 


3529 


AF 194537 


Homo sapiens 


NAG 13 


119 


88 


3530 


U93564 


Homo sapiens 


putative pi 50 


234 


84 


3531 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


208 


63 


3532 


U 12690 


Homo sapiens 


cytochrome oxidase subunit II 


275 


92 


3533 


AF090931 


Homo sapiens 


PRO0483 


126 


66 


3534 


W05278 


Homo sapiens 


Tumour necrosis factor-related gene 
product CL6.5.40. 


117 


85 


3535 


G03076 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7157. 


173 


77 


3536 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


45 


3537 


AF209061 


Eubranchipus sp. 


cytochrome c oxidase I 


127 


80 


3538 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 3 


174 


85 


3539 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


118 


45 


3540 


R13556 


Homo sapiens 


Protein encoded downstream of hhcjvl 
oncoprotein. 


88 


50 


3541 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


128 


71 


3542 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 11A 


111 


77 


3543 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


149 


69 


3544 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


470 


80 


3545 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


54 


3546 


AL359782 


Trypanosoma 
brucei 


probable similar to ring-h2 finger 
protein rhala. 


106 


42 


3547 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


584 


83 


3548 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


635 


95 


3549 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 3 


661 


91 


3550 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


607 


86 


3551 


AK024455 


Homo sapiens 


FLJ00047 protein 


88 


51 


3552 


W88957 


Homo sapiens 


Polypeptide fragment encoded by gene 
128. 


500 


86 


3553 


D38H3 


Pan troglodytes 


NADH dehydrogenase subunit 4 


228 


95 


3554 


AF130092 


Homo sapiens 


PRO2620 


165 


93 


3555 


AL121845 


Homo sapiens 


dJ583P15.5.1 (novel protein (isoform 
1)) 


659 


91 


3556 


G03062 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7143. 


170 


72 


n ceo 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


599 


88 


3558 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 3 


578 


84 


3559 


AK000385 


Homo sapiens 


unnamed protein product 


138 


47 


3560 


Y02886 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 90. 


112 


61 


3561 


X97675 


Homo sapiens 


plakophilin 2b 


123 


65 


3562 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


94 


62 
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% 
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y 








6595. 






3563 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


198 


90 


3564 


X86029 


Vigna 
unguiculata 


extens in-like protein 


141 


36 


3565 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


77 


60 


3566 


AP000616 


Oryza sativa 


similar to RING-H2 ringer protein 
RHAla(AF078683) 


1 10 


94 


3567 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


203 


93 


3568 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


146 


72 


3569 


S62928 


Homo sapiens 


PRB1M protein precursor 


102 


44 


3570 


D86853 


Catharanthus 
roseus 


extensin 


123 


40 


3571 


U93574 


Homo sapiens 


putative p 1 50 


279 


56 


3572 


Y02785 


Homo sapiens 


Human secreted protein encoded by 
gene 51 clone HUKEX85. 


94 


82 


3573 


U45964 


Herpesvirus 
papio 


LMP1 


127 


35 


3574 


Y08319 


Homo sapiens 


kinesin-2 


254 


55 


3575 


X97675 


Homo sapiens 


plakophilin 2b 


134 


54 


3576 


AF003151 


Caenorhabditis 
elegans 


contains similarity to an RJMA 
recognition motif 


137 


43 


3577 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


126 


66 


3578 


U80447 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
ykl87f6.5; coded for by C. elegans 
cDNA ykl 87f6.3; coded for by C. 
elegans cDNA ykl46f 1 1 .5 


117 


50 


3579 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24JrIUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


126 


31 


3580 


AJ249395 


Globodera 
pallida 


■VTA T\T T 1 * • * J 1 r 

NADH-ubiqumone oxidoreductase 
subunit 4 


85 


33 


3581 


U091 16 


Homo sapiens 


ORF1, encodes a 40 kDa product 


139 


55 


3582 


X97675 


Homo sapiens 


plakophilin 2b 


125 


73 


3583 


L26953 


Homo sapiens 


chromosomal protein 


101 


55 


3584 


S80343 


Homo sapiens 


arginyl-tRNA synthetase, ArgRS 


110 


67 


3585 


X97675 


Homo sapiens 


plakophilin 2b 


116 


84 


3586 


X92485 


Plasmodium 
vivax 


pval 


124 


50 


3587 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


126 


56 


3588 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


136 


56 


3589 


U63542 


Homo sapiens 


FAP protein 


128 


75 


3590 


D38112- 


Homo sapiens 


cytochrome c oxidase subunit 3 


642 


88 


3591 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 1 


612 


87 


3592 


K02401 


Homo sapiens 


chorionic somatomammotropin 


576 


86 


3593 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


108 


40 


3594 


V00662 


Homo sapiens 


cytochrome oxidase I 


459 


76 


3595 


AC008262 


Arabidopsis 
thaliana 


F4N2.10 


136 


31 


3596 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


126 


52 
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3597 


AF 116645 


Homo sapiens 


PRO 1708 


623 


91 


3598 


M23028. 


Human 
herpesvirus 4 


nuclear antigen precursor 


121 


35 


3599 


W88816 


Homo sapiens 


Polypeptide fragment encoded by gene 
58. 


100 


33 


3600 


X92485 


Plasmodium 
vivax 


pval 


156 


36 


3601 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

a 

/ IZ4-. 


122 


78 


3602 


AK000496 


Homo sapiens 


unnamed protein product 


150 


68 


3603 


AF083929 


Mus museums 


ES18 


98 


40 


3604 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


537 


83 


3605 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


548 


85 


3606 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


158 


57 


3607 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


429 


75 


3608 


G03356 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7437. 


122 


65 


3609 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


88 


3610 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 5 


224 


80 


3611 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


481 


70 


3612 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


166 


92 


3613 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


83 


3614 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


167 


91 


3615 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


582 


77 


3616 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) and D. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


101 


40 


3617 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


591 


85 


3618 


AB037275 


Cynomolgus 
Epstein-Barr 
Virus TsB-B6 


EBNA-1 


119 


42 


3619 


AF061944 


Homo sapiens 


kinase deficient protein KDP 


581 


91 


3620 


Y34068 


Homo sapiens 


Histone HI isoform HI .5 pANCA- 
reactive fragment (residues 69-226). 


100 


33 


3621 


AL049608 


Arabidopsis 
thaliana 


extensin-like protein 


105 


27 


3622 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla(AF078683) 


127 


69 


3623 


S71569 


Neocallimastix 
patriciarum, 
Peptide, 860 aa 


Xylanase B, XYLB {EC 3.2.1.8} 
• 


108 


43 


3624 


U23172 


Caenorhabditis 
elegans 


similar to myoblast cell surface antigen 
(SP:CS24_HUMAN, P23246) andD. 
melanogaster No-on-transient A protein 
(PIR:JH0162) 


101 


37 


3625 


t^vo n 1 1 a 

D38H2 


Homo sapiens 


NADH dehydrogenase subunit 5 


390 


83 


3626 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


582 


84 


<-> /Ton 

3627 


DJol 12 


Homo sapiens 


cytochrome c oxidase subunit 3 


637 


C\f\ 

90 


3628 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


169 


39 


3629 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


142 


77 


3630 


U93564 


Homo sapiens 


putative pi 50 


330 


90 


3631 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


148 


42 
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SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3632 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


100 


80 


3633 


G02360 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6441. 


139 


65 


3634 


U74071 


Phascolosoma 
sp. 'California' 


cytochrome c oxidase subunit I 


248 


72 


3635 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


121 


87 


3636 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


128 


65 


3637 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


124 


63 


3638 


Y08062 


Homo sapiens 


Human PR0245 protein fragment 
derived from DNA3563S. 


159 


51 


3639 


X65582 


Mus musculus 


alpha-2 collagen 


102 


53 


3640 


AF090942 


Homo sapiens 


PRO0657 


97 


76 


3641 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


146 


69 


3642 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


118 


55 


3643 


X14576 


Murine leukemia 
virus 


gagpl5 protein 


134 


44 


3644 


AF130051 


Homo sapiens 


PRO0898 


177 


48 


3646 


G03064 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7145. 


120 


59 


3647 


AK000385 


Homo sapiens 


unnamed protein product 


105 


40 


3648 


V00662 


Homo sapiens 


cytochrome oxidase I 


533 


83 


3649 


AFO 90944 


Homo sapiens 


PRO0663 


129 


73 


3650 


AF090944 


Homo sapiens 


?RO0663 


129 


73 


3651 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


88 


60 


3652 


AF090930 


Homo sapiens 


PRO0478 


92 


57 


3653 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


148 


73 


3654 


ABO 18440 


Echinococcus 
multilocularis 


NADH dehydrogenase subunit 2 


100 


32 


3655 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


165 


68 


3656 


AF161356 


Homo sapiens 


HSPC093 


106 


48 


3657 


AF090930 


Homo sapiens 


PRO0478 


158 


85 


3658 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


128 


82 


3659 


U93574 


Homo sapiens 


putative pi 50 


117 


44 


3660 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


149 


51 


3661 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


67 


70 


3662 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


99 


62 


3663 


AL451017 


Neurospora 
crassa 


related to Ul SMALL NUCLEAR 
RIBONUCLEOPROTEIN C 


154 


45 


3664 


Y27571 


Homo sapiens 


Human secreted protein encoded by 
gene No. 5. 


136 


77 


3665 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


128 


59 


3666 


U47855 


Araneus 
diadematus 


fibroin-3 


109 


39 


3667 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


604 


88 


3668 


K02401 


Homo sapiens 


chorionic somatomammotropin 


636 


93 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


3669 


AF083929 


Mus museums 


ESI 8 


133 


46 


3670 


J01415 


Homo sapiens 


MTND4 


620 


87 


3671 


AK025047 


Homo sapiens 


unnamed protein product 


160 


60 


3672 


G03793 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7874. 


106 


31 


3673 


ABO 1903 8 


Homo sapiens 


beta- 1 ,4 mannosy transferase 


766 


94 


3674 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog HA 


113 


69 


3675 


U80447 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
ykl87f6.5; coded for by C. elegans 
cDNA yk!87f6.3; coded for by C. 
elegans cDNA ykl46fl L5 


113 


39 


3676 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


197 


68 


3677 


U52077 


Homo sapiens 


mariner transposase 


500 


74 


3678 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63, 


174 


97 


3679 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 1 63. 


191 


86 


3680 


AC003113 


Arabidopsis 
thaliana 


F24Q1.6 


102 


83 * 


3681 


AF005370 


Alcelaphine 
herpesvirus I 


putative immediate early protein 


153 


42 


3682 


GO 1502 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5583. 


111 


62 


3683 


AP002543 


Arabidopsis 
thaliana 


gb|AAD230 1 5. l~geneJd:Fl5M7. 16~si 
milar to unknown protein 


131 


40 


3684 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


146 


64 


3685 


U93572 


Homo sapiens 


putative pi 50 


158 


43 


3686 


Y34068 


Homo sapiens 


Histone HI isoform HI. 5 pANCA- 
reactive fragment (residues 69-226). 


98 


43 


3687 


W80406 


Homo sapiens 


A secreted protein encoded by clone 
dh40_3. 


106 


54 


3688 


L26953 


Homo sapiens 


chromosomal protein 


128 


76 


3689 


D38112 


Homo sapiens 


cytochrome c oxidase subunit l 


563 


87 


3690 


R96800 


Homo sapiens 


Human histiocyte-sscreted factor HSF. 


109 


67 


3691 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:l63. 


122 


75 


3692 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


165 


85 


3693 


M20789 


Homo sapiens 


alpha- 1 type I collagen 


141 


43 


3694 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


162 


50 


3695 


AF210651 


Homo sapiens 


NAG 18 


124 


67 


3696 


AF130079 


Homo sapiens 


PR02852 


94 


53 


3697 


Y19192 


Talpa europaea 


cytochrome oxidase subunit I 


618 


85 


3698 


D38H2 


Homo sapiens 


NADH dehydrogenase subunit 2 


414 


86 


3699 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


487 


76 


3700 


G02455 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6536. 


109 


42 


3701 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


563 


85 


3702 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


106 


48 


3703 


D38H2 


Homo sapiens 


cytochrome c oxidase subunit 1 


522 


79 


3704 


D67066 


Bos taurus 


N-WASP 


219 


42 


3705 


Y36366 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 3. 


100 


51 


3706 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


115 


44 



226 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 








7879. 






3707 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


105 


37 


3708 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


125 


74 


3709 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


558 


88 


3710 


AF217374 


Acanthaster. 
planci 


cytochrome oxidase subunit I 


512 


85 


3711 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


186 


80 


3712 


AF197854 


Melithreptus 
hinatus 


cytochrome oxidase I 


189 


83 


3713 


AP000616 


Oryza sativa 


similar to RING -J 12 finger protein 
RHAla(AF078683) 


129 


95 


3714 


AB029027 


Homo sapiens 


KIAA1 104 protein 


141 


81 


3715 


L26251 


Trypanosoma 
brucei 


CR5 


73 


31 


3716 


R95913 


Homo sapiens 


Neural thread protein. 


117 


44 


3717 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


95 


67 


3718 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


108 


81 


3719 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


132 


55 


3720 


M64923 


Bos taurus 


C 1 0 protein 


194 


94 


3721 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


115 


75 


3722 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


1 18 


95 


3723 


AF302773 


Homo sapiens 


ninein-Lm isoform 


157 


57 


3724 


AK024455 


Homo sapiens 


FLJ00047 protein 


134 


59 


3725 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


161 


80 


3726 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


121 


52 


3727 


AF220264 


Homo sapiens 


MOST-1 


86 


88 


3728 


L27428 


Homo sapiens 


reverse transcriptase 


316 


62 


3729 


U83303 


Homo sapiens 


line-1 reverse transcriptase 


112 


46 


3730 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


145 


68 


3731 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


118 


56 


3732 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


65 


3733 


AF090930 


Homo sapiens 


PRO0478 


148 


71 


3734 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


140 


72 


3735 


U93569 


Homo sapiens 


putative pi 50 


270 


59 


3736 


U93568 


Homo sapiens 


putative pi 50 


151 


38 


3737 


G02493 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6574. 


124 


82 


3738 


X92485 


Plasmodium 
vivax 


pval 


101 


44 


3739 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

Ha 1 n 


146 


51 


3740 


AF090944 


Homo sapiens 


PRO0663 


124 


56 


3741 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


94 


47 


3742 


AF1 18086 


Homo sapiens 


PRO 1992 


124 


61 


3743 


AF083929 


Mus musculus 


ESI 8 


108 


35 


3744 


L34685 


Arabidopsis 


cell wall protein 


120 


34 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 






thaliana 








3745 


AF090931 


Homo sapiens 


PRO0483 


117 


54 


3746 


AF090930 


Homo sapiens 


PRO0478 


136 


75 


3747 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


154 


59 


3748 


G00368 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4449. 


92 


44 


3749 


AP000616 


Oryza sativa 


similar to RJNG-H2 finger protein 
RHAla(AF078683) 


98 


42 


3750 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


165 


73 


3751 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


113 


87 


3752 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


3753 


U93574 


Homo sapiens 


putative pi 50 


157 


60 


3754 


X56015 


Crithidia 
oncopelti 


NADH dehydrogenase subunit 5 


103 


30 


3755 


M80613 


Homo sapiens 


putative 


114 


26 


3756 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


119 


54 


3757 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


210 


89 


3758 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


91 


58 


3759 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


110 


46 


3760 


M13100 


Rattus 
norvegicus 


unknown protein 


119 


46 


3761 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


135 


52 


3762 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


132 


84 


3763 


Ml 1900 


Mus musculus 


15-kDa proline-rich salivary protein 


125 


40 


3764 


U63542 


Homo sapiens 


FAP protein 


131 


72 


3765 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


143 


76 


3766 


X92485 


Plasmodium 
vivax 


pval 


119 


54 


3767 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


81 


56 


3768 


G03411 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7492. 


98 


65 


3769 


AB005216 


Homo sapiens 


Nek, Ash and phospholipase C gamma- 
binding protein NAP4 


487 


90 


3770 


AF026211 


Caenorhabditis 
elegans 


Similar to cuticular collagen 


102 


37 


3771 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


91 


55 


3772 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6566. 


128 


76 


3773 


AK024455 


Homo sapiens 


FLJ00047 protein 


138 


66 


3774 


AF090895 


Homo sapiens 


PRO0117 


152 


60 


3775 


AF130051 


Homo sapiens 


PRO0898 


122 


78 


3776 


G01246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


100 


63 


3777 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


187 


75 


3778 


G00442 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4523. 


130 


56 
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SEQ ID 


Accession No. 


Species 


Description 


Smith- 


% 


NO: 








Waterman 


Identit 










Score 


y 


3779 


X55685 


Lycopersicon 


extensin (class I) 


132 


38 






esculentum 








3780 


L27428 


Homo sapiens 


reverse transcriptase 


116 


53 


3781 


Y71066 


Homo sapiens 


Human membrane transport protein, 


188 


86 








MTRP-11. 






3782 


AL390114 


Leishmania 


extremely cysteine/valine rich protein 


122 


67 






major 








3783 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 


462 


75 








similar to pogo element 






3784 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


67 








4414. 






3785 


Y67470 


Homo sapiens 


Np70 protein carboxy terminal region. 


129 


36 


3786 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


152 


78 


3787 


S80119 


Rattus sp. 


reverse transcriptase homolog 


143 


40 


3788 


AF090942 


Homo sapiens 


PRO0657 


134 


67 


3789 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


135 


63 








7795. 






3790 


AF1 16661 


Homo sapiens 


PR01438 


128 


45 


3791 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


130 


38 


3792 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


74 








4484. 






3793 


W50192 


Homo sapiens 


Amino acid sequence of salivary 


101 


48 








protein CON-l. 






3794 


AF000298 


Caenorhabditis 


weak similarity to collagens; glycine- 


175 


46 






elegans 


and proline-rich 






3795 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


117 


40 


3796 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


185 


88 








SEQ ID NO: 163. 






3797 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


610 


84 


3798 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


156 


56 








6619. 






3799 


AL359782 


Trypanosoma . 


probable similar to ring-h2 finger 


71 


52 






brucei 


protein rhala. 






3800 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


127 


77 








8172. 






3801 


AF130079 


Homo sapiens 


PR02852 


164 


54 


3802 


AL359782 


Trypanosoma 


probable similar to ring-h2 finger 


118 


45 






brucei 


protein rhala. 






3803 


U42580 


Paramecium 


Pro-rich, PAPK (20X); similar to 


101 


36 






burs aria 


Arabidopsis anter-speciflc Pro-rich 










Chlorella virus 1 


protein, corresponds to Swiss-Prot 












Accession Number P40602 






3804 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


173 


87 








SEQ ID NO: 163. 






3805 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


72 








7253. 






3806 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


225 


85 


3807 


U88587 


Nicotiana alata 


120 kDa style glycoprotein 


118 


38 


3808 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


129 


82 


3809 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


122 


67 








7124. 






3810 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 


213 


97 








SEQ ID NO: 163. 






3811 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


106 


63 








7124. 






3812 


AF169974 


Homo sapiens 


serine racemase 


153 


56 


3813 


L27428 


Homo sapiens 


reverse transcriptase 


175 


43 


3814 


W50193 


Homo sapiens 


Amino acid sequence of salivary 


109 


43 








protein CON-2. 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Idcntit 

y 


3815 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


176 


78 


3816 


G00352 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4433. 


1 12 


65 


3817 


AF1 19901 


Homo sapiens 


PR02S31 


1 13 


55 


3818 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


99 


89 


3819 


AL160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


91 


66 


3820 


X92485 


Plasmodium 

vivax 


pval 


108 


46 


3821 


AK024455 


Homo sapiens 


FLJ00047 protein 


79 


59 


3822 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 10L 


178 


87 


3823 


Y86472 


Homo sapiens 


Human gene 52-encoded protein 
fragment, SEQ ID NO:387. 


99 


50 


3824 


W21581 


Homo sapiens 


Alzheimer's disease protein encoded by 
DNA from plasmid pGCSl 1 037. 


86 


94 


3825 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
47 18. 


138 


53 


3826 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


129 


88 


3827 


G02987 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7068. 


77 


61 


3828 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


139 


51 


3829 


K03208 


Homo sapiens 


salivary proline-rich protein precursor 


161 


41 


3830 


U44838 


Glycine max 


extensin 


108 


47 


3831 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


136 


71 


3832 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


84 


3833 


AL163912 


Arabidopsis 
thaliana 


glycine-rich protein atGRP-7 


117 


37 


3834 


AF130089 


Homo sapiens 


PRO2550 


138 


45 


3835 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


92 


76 


3836 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


152 


70 


3837 


X56010 


Sorghum bicolor 


hydroxyproline-rich glycoprotein 


98 


37 


3838 


X6S249 


Xenopus laevis 


Proline rich protein 


92 


66 


3839 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


262 


92 


3840 


R63235 


Homo sapiens 


CNS neural thread protein HB4. 


186 


100 


3841 


G04091 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8172. 


103 


64 


3842 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


116 


62 


3843 


AF164614 


Homo sapiens 


envelope protein 


508 


77 


3844 


AF130051 


Homo sapiens 


PRO0898 


130 


73 


3845 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


144 


74 


3846 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

A A C\T 

4497. 


119 


68 


3847 


M13100 


Rattus 
norvegicus 


unknown protein 


129 


46 


3848 


X97675 


Homo sapiens 


plakophilin 2b 


145 


67 


3849 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


141 


65 


3850 


K03206 


Homo sapiens 


salivary proline-rich protein precursor 


114 


42 
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Smith- 
Waterman 
Score 


% 
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y 


3851 


U93574 


Homo sapiens 


putative pi 50 


165 


82 


3852 


AF090931 


Homo sapiens 


PRO0483 


97 


78 


3853 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


279 


80 


3854 


U93570 


Homo sapiens 


putative pi 50 


135 


38 


3855 


M13100 


Rattus 
norvegicus 


unknown protein 


100 


57 


3856 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


166 


63 


3857 


Y48576 


Homo sapiens 


Human breast tumour-associated 
protein 37. 


117 


49 


3858 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


147 


45 


3859 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


150 


71 


3860 


G02994 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7075. 


165 


69 


3861 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


131 


83 


3862 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


171 


81 


3863 


AK000241 


Homo sapiens 


unnamed protein product 


100 


48 


3864 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


121 


80 


3865 


AK024455 


Homo sapiens 


FLJ00047 protein 


124 


69 


3866 


G02832 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6913. 


87 


64 


3867 


AF1 16712 


Homo sapiens 


PR02738 


92 


48 


386S 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


165 


62 


3869 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


128 


62 


3870 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


94 


75 


3871 


AK024455 


Homo sapiens 


FLJ00047 protein 


83 


72 


3872 


AJ242540 


Volvox carteri f. 
nagariensis 


hydroxyproline-rich glycoprotein DZ- 
HRGP 


147 


44 


3873 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


113 


51 


3874 


AF090930 


Homo sapiens 


PRO0478 


120 


63 


3875 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


145 


70 


3876 


D88674 


Homo sapiens 


antizyme inhibitor 


147 


75 


3877 


AF263744 


Homo sapiens 


erbb2-interacting protein ERBIN 


212 


37 


3878 


Y36112 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 497. 


140 


82 


3879 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


82 


35 


3880 


Y01181 


Homo sapiens 


Polypeptide fragment encoded by gene 
12. 


144 


60 


3881 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


122 


60 


3882 


G00577 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4658. 


134 


75 


3883 


M81321 


Macaca 
fascicularis 


proline-rich protein 


101 


41 


3884 


G03263 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7344. 


109 


56 


3885 


AF013214 


Bos taurus 


acidic ribosomal phosphoprotein PO 


177 


94 • 


3886 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


77 
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Waterman 

Score 


% 

Identit 

y 








4484. 






3887 


Y59860 


Homo sapiens 


Human normal uterus tissue derived 
protein 23. 


137 


72 


3888 


Y14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


147 


72 


3889 


M64792 


Rattus b 
norvegicus 


salivary proline-rich protein 


104 


40 


3890 


AF 156225 


Drosophila 
melanogaster 


salivary gland secretion protein 


104 


35 


3891 


"V/ry 1 i c O 

Y01 158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HO AC Jo 1 . 


90 


47 




YU14UU 


Homo sapiens 


Secreted protein encoded by gene 18 
clone HNHF029. 


1 O O 

IDS 


38 


3893 


D3 8112 


Homo sapiens 


NADH dehydrogenase subunit 5 


146 


65 


3894 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
787 1 . 


122 


34 


3895 


X02873 


Daucus carota 


put. precursor 


104 


48 


3896 


AF090942 


Homo sapiens 


PRO0657 


135 


50 


3897 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


195 


77 


3898 


X92485 


. Plasmodium 
vivax 


pval 


111 


62 


3899 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


107 


70 


3900 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


94 


93 


3901 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


83 


3902 


AF090942 


Homo sapiens 


TVT> t~\t\ £Z C~l 

PRO0657 


134 


65 


3903 


AF220264 


Homo sapiens 


MOST-1 


74 


59 


3904 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


131 


53 


3905 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


129 


85 


3906 


AK025047 


Homo sapiens 


unnamed protein product 


127 


73 


3907 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


135 


86 


3908 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

ATX Q 


134 


72 


3909 


X77816 


Rattus 
norvegicus 


PR-Vbetal 


104 


62 


3910 


G02654 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O ID J. 


114 


70 


3911 


AK024455 


Homo sapiens 


FLJ00047 protein 


135 


61 


3912 


U9:>567 


Homo sapiens 


putative pi 50 


233 


63 


3913 


W54966 


Homo sapiens 


Synthetic human type III collagen 
SYN-C3. 


96 


40 


3914 


X94976 


Brassica napus 


cell wall-plasma membrane linker 
protein 


104 


36 


3915 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


125 


73 


3916 


'Vr\'~> .HOC 

X92485 


Plasmodium 
vivax 


pval 


94 


64 


3917 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


105 


52 


3918 


AF1 18086 


Homo sapiens 


PRO 1992 


130 


87 


3919 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


125 


80 


3920 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


162 


79 
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Accession No. 


Species 


Description 


Smith- 
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Score 


% 

Identit 

y 








4493. 






3921 


AF090930 


Homo sapiens 


PRO0478 


128 


64 


3922 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


114 


75 


3923 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


130 


65 


3924 


Y87064 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 103. 


134 


71 


3925 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


102 


58 


3926 


D63487 


Homo sapiens 


The KIAA0153 gene product is related 
to a putative C.elegans gene encoded in 
cosmid F42A8. 


141 


100 


3927 


U21123 


Drosophila 
melanogaster 


ena polypeptide 


117 


37 


3928 


X58438 


Mus museums 


proline rich protein 


136 


36 


3929 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


152 


71 


3930 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718, 


132 


73 


3931 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


86 


48 


3932 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


110 


65 


3933 


X92485 


Plasmodium 
vivax 


pval 


116 


36 


3934 


X97675 


Homo sapiens 


plakophilin 2b 


129 


82 


3935 


G03258 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7339. 


84 


58 


3937 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
2>EQ ID JNO: 16 j. 


107 


56 


j93o 


A T7AHA One 

AF090&95 


Homo sapiens 


JrROO 117 


166 


68 


3939 


K02550 


Oncorhynchus 
mykiss 


70-kilodalton heat shock protein 


104 


38 


3940 


AK024455 


Homo sapiens 


FLj 00047 protein 


1 12 


74 


3941 


L26953 


Homo sapiens 


chromosomal protein 


92 


67 


3942 


U63542 


Homo sapiens 


FAP protein 


135 


75 


3943 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


139 


80 


3944 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


141 


67 


3945 


AK024455 


Homo sapiens 


FLJ00047 protein 


123 


76 


3946 


Y08769 


Rattus 
norvegicus 


microvascular endothelial 
differentiation gene 2 


92 


52 


3947 


W48927 


Homo sapiens 


Schwannomin-binding protein C- 
terminal fragment. 


103 


56 


3948 


AF252293 


Homo sapiens 


PARS 


161 


44 


3949 


Y02749 


Homo sapiens 


Human secreted protein encoded by 
gene 100 clone HNFIU96. 


125 


47 


3950 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


59 


one i 
J951 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8148. 


174 


87 


jyoZ 


VI 7/1 QO CI 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


1 CO 

152 


75 


3953 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


168 


64 


3954 


R95913 


Homo sapiens 


Neural thread protein. 


110 


54 


3955 


AF130089 


Homo sapiens 


PRO2550 


122 


63 


3956 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


119 


48 
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Smith- 
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Score 


% 

Identit 

y 


3957 


Y36203 


Homo sapiens 


Human secreted protein #75. 


121 


75 


3958 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


177 


61 


d959 


aao o o 

u00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


131 


69 


j960 


U93563 


Homo sapiens 


putative pi 50 


227 


62 


3961 


D86853 


Catharanthus 
roseus 


extensin 


100 


40 


3962 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


101 


41 


3963 


GO 1828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5909. 


107 


80 


3964 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


121 


42 


3965 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


150 


66 


3966 


a a i i aa i 

Ml 1901 


Rattus 
norvegicus 


proline-rich salivary protein 


101 


34 


3967 


AF229126 


Homo sapiens 


acetylcholinesterase collagen-like tail 
subunit isofbrm VII 


108 


36 


OA/CO 


A 1 r\r\/ZT~7 Pi 

AJUUo / /U 


Cicer anetinum 


extensin 


86 


29 


3969 


Aril AnOft 

Ar 1301)59 


Homo sapiens 


FKU2550 


128 


82 


j970 


U9356J 


Homo sapiens 


putative pi 50 


99 


48 


3971 


AK000496 


Homo sapiens 


unnamed protein product 


134 


71 


3972 


Z70684 


Caenorhabditis 
elegans 


F28D1.8 


108 


42 


3973 


A T? A 1 A 1 A A 

At 0 1 0 1 44 


Homo sapiens 


neuronal thread protein AD7c-NTP 


137 


72 


3974 


AK024455 


Homo sapiens 


>LJ00047 protem 


132 


65 


3975 


AK024455 


Homo sapiens 


FLJ00047 protein 


128 


85 


3976 


X01918 


Drosophila 
melanogaster 


salivary gland glue protein 


99 


40 


O C\T~I 

3977 


(_r02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


136 


46 


1 0*70 

3978 


A "COO A A 

At 1 229067 


Homo sapiens 


PADI-H protein 


107 


87 


3979 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


189 


48 


3980 


S80119 


Rattus sp. 


reverse transcriptase homolog 


134 


49 


3981 


AF090901 


Homo sapiens 


TNT* a i rv 

PRO0 1 95 


106 


94 


3982 


AF025467 


Caenorhabditis 
elegans 


contains similarity to drosophila DNA- 
binding protein K10 (NID:g8148) 


1 1 1 


42 




"V AO /IOC 


Plasmodium 
vivax 


pval 


102 


39 


jyo4 


IjUzo4U 


Homo sapiens 


Human secreted protein, iSiiQ ID JNU: 


1 TT 

127 


HA 

64 


J70J 


A DAI Ol /CO 


xiorno sapiens 


jt\.rK-,\-* protein 


1 GA 


4Z 


3986 


AF0 16099 


Mus musculus 


endonuclease/reverse transcriptase 


139 


65 


jVo / 


T TOO ^'TA 


Homo sapiens 


putative pi 50 


1 -"7 A 


H 1 

71 




U3ol 1Z 


Homo sapiens 


NADH dehydrogenase subunit 5 


zUz 


7-> 




"Pl^ olio 
JLOO 1 1Z 


Homo sapiens 


NADH dehydrogenase subunit 5 


■"> A A 

z44 


O Z 
OJ 


3990 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


420 


92 


onn i 

jyy l 


D381 12 


Homo sapiens 


NADH dehydrogenase subunit 4 


272 \ 


85 


3992 


T OOAAQ 

L389l)o 


Nicotiana 

Ck V\ ?a /■» l inn 
(.<lUd.CU.IH 


extensin 


114 


40 


3993 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


116 


64 


3994 


AF090944 


Homo sapiens 


PRO0663 


99 


, 39 


3995 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


150 


75 


3996 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


574 


87 


3997 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


488 


76 • 
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% 
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3998 


U52077 


Homo sapiens 


mariner transposase 


475 


70 


3999 


U93570 


Homo sapiens 


p40 


111 


34 


4000 


U 15647 


Mus musculus 


reverse transcriptase 


137 


43 


4001 


AF1 16712 


Homo sapiens 


PR0273 8 


105 


52 


4002 


AF1 13685 


Homo sapiens 


PRO0974 


125 


53 


4003 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


92 


78 


4004 


AF1 16715 


Homo sapiens 


PR02829 


115 


73 


4005 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 2 


288 


80 


4006 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


614 


86 


4007 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


106 


45 


4008 


U90268 


Homo sapiens 


Kritl 


152 


52 


4009 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


472 


96 


4010 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


304 


100 


4011 


B03148 


Homo sapiens 


Human neuronal differentiation factor- 
1 (NDF-1). 


472 


98 


4012 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


102 


50 


4013 


G00262 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4343. 


124 


73 


4014 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


118 


47 


4015 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


522 


81 


4016 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


147 


52 


4017 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


106 


57 


4018 


AB003476 


Homo sapiens 


gravin 


549 


91 


4019 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


391 


93 


4020 


U44949 


Xenopus laevis 


zona pellucida A glycoprotein homolog 


143 


32 


4021 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


571 


88 


4022 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


554 


87 


4023 


G04078 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8159. 


165 


88 


4024 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


131 


38 


4025 


D38113 


Pan troglodytes 


cytochrome c oxidase subunit 1 


583 


86 


4026 


AF090895 


Homo sapiens 


PRO0117 


104 


49 


4027 


AF 13 0056 


Homo sapiens 


PRO 1367 


80 


60 


4028 


X97675 


Homo sapiens 


plakophilm 2b 


157 


83 


4029 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 
kDa subunit 


114 


35 


4030 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


110 


74 


4031 


D38112 


Homo sapiens 


ATPase subunit 6 


395 


80 


4032 


S70718 


Hemicentrotus 
pulcherrimus=se 
a urchins, tests, 
Peptide, 632 aa 


fibrillar collagen alpha 120 and 140 
chains 


104 


33 


4033 


Y12713 


Mus musculus 


Pro-Pol-dUTPase polyprotein 


355 


66 


4034 


AC004497 


Homo sapiens 


MX2 


464 


58 


4035 


G00333 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4414. 


151 


61 


4036 


AF1 17888 


Homo sapiens 


myosin-IXa 


113 


34 


4038 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ IDNO:163. 


100 


48 


4039 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


153 


73 
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% 
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4040 


D38112 


Homo sapiens 


NADH dehydrogenase subunit 4 


338 


86 


4041 


AF1 16661 


Homo sapiens 


PRO 143 8 


136 


53 


4042 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


120 


84 


4043 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 3 


493 


85 


4044 


M90516 


Homo sapiens 


glutamine:fmctose-6-phosphate 
amidotransferase 


261 


72 


4045 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


102 


64 


4046 


AF130089 


Homo sapiens 


PRO2550 


143 


78 


4047 


AF090942 


Homo sapiens 


PRO0657 


119 


41 


4048 


R95913 


Homo sapiens 


Neural thread protein. 


138 


47 


4049 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


111 


59 


4050 


X77962 


Tetrahymena 
thermophila 


fibrillarin 


105 


49 


4051 


G00338 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4419. 


144 


71 


4052 


X02794 


Friend murine 
leukemia virus 


Pr65 


107 


37 


4053 


Y00994 


Homo sapiens 


Human CSR3 protein sequence. 


109 


38 


4054 


AL121585 


Homo sapiens 


bA504H3. 1 (SNX5 (sorting nexin 5)) 


299 


74 


4055 


V00662 


Homo sapiens 


URF 1 (NADH dehydrogenase subunit) 


314 


80 


4056 


G03438 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7519. 


65 


54 


4057 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


67 


4058 


AF205385 


Pan troglodytes 


NADH dehydrogenase subunit 5 


202 


89 


4059 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


187 


100 


4060 


U63542 


Homo sapiens 


FAP protein 


142 


73 


4061 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


158 


50 


4062 


Y95028 


Homo sapiens 


Human clone vp7 1 ORF2, SEQ ID 
NO:128. 


123 


44 


4063 


G00361 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4442. 


76 


65 


4064 


AF000298 


Caenorhabditis 
elegans 


weak similarity to collagens; glycine- 
and proline-rich 


165 


46 


4065 


G00427 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4508. 


147 


93 


4066 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


76 


4067 


AB035523 


Gallus gallus 


avenalll 


101 


48 


4068 


AC079829 


Arabidopsis 
thaliana 


Pto kinase interactor, putative 


105 


43 


4069 


AF2 10651 


Homo sapiens 


NAG 18 


97 


77 


4070 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


111 


43 


4071 


G02538 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6619. 


158 


65 


4072 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


131 


43 


4073 


AF038007 


Homo sapiens 


FIC1 


153 


96 


4074 


M33228 


Trypanosoma 
brucei 


ATPase 6 


103 


37 


4075 


U54788 


Mus musculus 


Wiskott-Aldrich Syndrome Protein 


119 


46 


4076 


M81321 


Macaca 
fascicularis 


proline-rich protein 


104 


36 


4077 


X52235 


Homo sapiens 


ORFII 


192 


39 


4078 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


126 


83 
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4409. 






4079 


AB026542 


Homo sapiens 


WASP-family protein 


98 


41 


4080 


L17318 


Rattus 
norvegicus 


proline-rich proteoglycan 


105 


42 


4081 


AF130051 


Homo sapiens 


FRO0S9S 


117 


42 


4082 


AF130089 


Homo sapiens 


PRO2550 


85 


79 


4083 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


117 


48 


4084 


W50192 


Homo sapiens 


Amino acid sequence of salivary 
protein CON- 1. 


89 


40 


4085 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


163 


80 


4086 


G00403 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4484. 


144 


74 


4087 


AF085691 


Homo sapiens 


multidrug resistance-associated protein 
3A 


197 


49 


4088 


G02490 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6571. 


109 


62 


4089 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


139 


51 


4090 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


103 


65 


4091 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


159 


68 


4092 


X92485 


Plasmodium 
vtvax 


pval 


95 


38 


4093 


U93569 


Homo sapiens 


p40 


205 


40 


4094 


L 16864 


African swine 
fever virus 


cd2 homologue 


98 


45 


4095 


X71413 


Homo sapiens 


ELE1 


675 


98 


4096 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


149 


60 


4097 


Y 14482 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 17. 


116 


70 


4098 


AF090930 


Homo sapiens 


PRO0478 


149 


78 


4099 


AF090942 


Homo sapiens 


PRO0657 


124 


56 


4100 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


139 


62 


4101 


L26953 


Homo sapiens 


chromosomal protein 


104 


54 


4102 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


128 


70 


4103 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


94 


58 


4104 


U73199 


Mus musculus 


Rho-guanine nucleotide exchange 
factor 


370 


56 


4105 


AF090931 


Homo sapiens 


PRO0483 


104 


90 


4106 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


133 


72 


4107 


AF1 18082 


Homo sapiens 


PRO 1902 


145 


49 


4108 


AF265575 


Homo sapiens 


ubiquitous TPR-motif protein Y 
isoform 


116 


50 


4109 


U93564 


Homo sapiens 


p40 


539 


91 


4110 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


144 


54 


4111 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
' 7888. 


138 


65 


4112 


AF217536 


Homo sapiens 


truncated mevalonate kinase 


91 


73 


4113 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


120 


36 


4114 


Ml 7522 


Paracoccus 


cytochrome cl precursor (EC 1.10.2.2) 


101 


41 
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denitrificans 








4115 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


121 


59 


4116 


G02532 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6613. 


148 


73 


4117 


G01931 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6012. 


80 


85 


4118 


X04758 


Homo sapiens 


pro- alpha (V)collagen (A A 1099) 


106 


39 


4119 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


122 


47 


4120 


AF130089 


Homo sapiens 


PRO2550 


132 


68 


4121 


AF090942 


Homo sapiens 


PRO0657 


136 


48 


4122 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


115 


66 


4123 


W50193 


Homo sapiens 


Amino acid sequence of salivary 
protein CON-2. 


72 


48 


4124 


X92485 


Plasmodium 
vivax 


pval 


104 


54 


4125 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


131 


59 


4126 


AK024455 


Homo sapiens 


FLJ00047 protein 


86 


61 


4127 


AF1 19855 


Homo sapiens 


PRO 1847 


99 


68 


4128 


S80119 


Rattus sp. 


reverse transcriptase homolog 


129 


39 


4129 


G00344 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4425. 


156 


62 


4130 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


72 


4131 


R86406 


Homo sapiens 


Human matrix metalloprotease 
MMPmla. 


108 


83 


4132 


L16461 


Chlamydomonas 
reinhardtii 


structural wall protein 


87 


37 


4133 


Y30713 


Homo sapiens 


Amino acid sequence of a human 
secreted protein. 


232 


95 


4134 


Y27854 


Homo sapiens 


Human secreted protein encoded by 
gene No. 101. 


121 


69 


4135 


G01984 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6065. 


135 


70 


4136 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


134 


42 


4137 


Y15173 


Human 

papillomavirus 
type 75 


E4 protein 


101 


38 


4138 


AF130089 


Homo sapiens 


PRO2550 


130 


35 


4139 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


111 


38 


4140 


Y36203 


Homo sapiens 


Human secreted protein #75. 


120 


48 


4141 


AF130089 


Homo sapiens 


PRO2550 


139 


36 


4142 


S80119 


Rattus sp. 


reverse transcriptase homolog 


170 


47 


4143 


U54636 


Staphylococcus 
aureus 


protein A 


135 


35 


4144 


AL390114 


Leishmania 
major 


extremely cysteine/valine rich protein 


157 


55 


4145 


G00354 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4435. 


112 


54 


4146 


G03203 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7284. 


117 


37 


4147 


R07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhcm) gene preoduct. 


98 


61 


4148 


AF130051 


Homo sapiens 


PRO0898 


124 


88 


4149 


G00673 


Homo sapiens 


Human secreted protein, SEQ ID NO: 


133 


67 
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/IOC/1 

4 / j4. 






A 1 Cf\ 

4150 


Z2V4&1 


Homo sapiens 


3-hydroxyanthranilic acid dioxygenase 


O A A 

244 


/:o 
62 


4151 


T\ /TO lOTI 

Mo lizl 


Macaca 
fascicularis 


pro line-rich protein 


172 


fA 

50 


A 1 CO 


vjUU41o 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


1 OA 

129 


oo 


41 jJ 




Homo sapiens 


poni ooo 
fKU 1 1 22 


IZo 


C)2 


4154 


AB037826 


Homo sapiens 


KIAA1405 protein 


194 


60 


4155 


A /T 1 O AAA 


Mus musculus 


pro line-rich protein 


124 


37 


4156 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


122 


46 


4157 


M11901 


Rattus 
norvegicus 


proline-rich salivary protein 


124 


41 


4158 


G00365 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4446. 


134 


83 


4159 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


141 


74 


4160 


M76546 


Helianthus 
annuus 


hydroxyproline-rich protein 


135 


39 


4161 


U35730 


Mus musculus 


jerky 


116 


31 


4162 


Z81525 


Caenorhabditis 
elegans 


contains similarity to Pfam domain: 
PF01391 (Collagen triple helix repeat 
(20 copies)), Score=35.5,E-value=4e~ 
07, N=2 


130 


50 


4163 


AF1 13685 


Homo sapiens 


PRO0974 


129 


45 


4164 


G04063 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O 1 A A 

8144. 


168 


71 


4165 


U49973 


Homo sapiens 

| 1 


ORF1; MER37; putative transposase 
similar to pogo element 


137 


70 


4166 


O AO /CO O 

1303628 


Homo sapiens 


Human phospholipase 2 HPPL2. 


175 


72 


4167 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

lot I. 


86 


45 


4168 


AB000505 


Daucus carota 


unnamed protein product 


101 


43 


4169 


U87607 


Rattus 
norvegicus 


putative RNA binding protem 1 


113 


30 


4170 


AFl 19900 


Homo sapiens 


PR02822 


154 


59 


4171 


AE001381 


Plasmodium 
falciparum 


hypothetical protem 


106 


33 


4172 


AE000034 


Mycoplasma 
pneumoniae 


bifunctional threonine dehydrogenase; 
galactosyltransferase 


100 


28 


4173 


U12390 


Cloning vector 
pSportl 


beta-galactosidase alpha peptide 


100 


47 


4174 


G02480 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6561. 


142 


68 


4175 


A 17" AO A ACC 

AK024455 


Homo sapiens 


■pT TAAA/IO „ '■ j ' 

rU 00047 protem 


1 16 


60 


4176 


Ml 1759 


Lycopersicon 
esculentum 


cell wall hydroxyproline-rich 
glycoprotein 


94 


44 


4177 


A T/" AO A A C C 

AKU24455 


Homo sapiens 


FLJ00047 protein 


102 


59 


A 1 70 
41 /O 


A VAA7C CO 

AY0U755 / 


Mycobacterium 
avium subsp. 
paratuberculosis 


fibronectin-attachment protein FAP-P 


98 


A O 

42 


4179 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 18 
cione ioiN xir kjaj. 


115 


69 


4180 


M12100 


Mus musculus 


proline-rich protein MP-3 


132 


44 


4181 


AF090930 


Homo sapiens 


PRO0478 


138 


63 


4182 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


152 


75 


4183 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


184 


84 
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4184 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


150 


43 


4185 


G04067 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

O 1 A O 

8148. 


128 


51 


A 1 0<C 

4186 


A T7 A < \ r\ r\ /I A 


Homo sapiens 




124 


49 


a i on 
418 / 




Homo sapiens 


Human secreted protein rlCHProo, 
oiiv LU £y{J. 1 65. 


93 


62 


A 1 O Q 

41oo 


Loo 1 


Homo sapiens 


A l rase suDunit o 


/ion 


78 




a T?i qah^ i 
Ar loUUDl 


Homo sapiens 






5 / 


4190 


AF090930 


Homo sapiens 


PRO0478 


137 


83 


4191 


K0j205 


Homo sapiens 


salivary proline-rich protein precursor 


100 


35 


4192 


V00662 


Homo sapiens 


URF 4 (NADH dehydrogenase subunit) 


212 


91 


4193 


Y13247 


Homo sapiens 


FBI 9 protein 


142 


46 


4194 


V00662 


Homo sapiens 


URF 2 (NADH dehydrogenase subunit) 


268 


93 


4195 


AF 124729 


Mus muscuius 


acinusS' 


140 


42 


4196 


AJ277425 


Globodera 
pallida 


putative cuticular collagen 


156 


43 


4197 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:l63. 


161 


77 


4198 


Z70684 


Caenorhabditis 
elegans 


F28DK8 


97 


45 


4199 


M64793 


Rattus 
norvegicus 


salivary proline-rich protein 


119 


36 


4200 


X62379 


Mus muscuius 


form in, isoform IV 


115 


48 


4201 


D381 13 


Pan troglodytes 


NADH dehydrogenase subunit 4 


224 


93 


4202 


M14820 


Trypanosoma 
brucei 


ORF2 bases 1807-2850; first start 
codon at 2032; putative 


103 


29 


4203 


U93572 


Homo sapiens 


p40 


182 


34 


4204 


D82026 


Silene latifolia 


glycine-rich protein 


98 


49 


4205 


AF003151 


Caenorhabditis 
elegans 


contains similarity to an RNA 
recognition motif 


107 


40 


4206 


L07545 


Leishmania 
tarentolae 


A 'c T was inserted after nt 369 (=nt 
10459 in genomic sequence (M10126)) 
to correct -1 frameshift probably due to 
gel compression 


101 


35 


4207 


K03205 


Homo sapiens 


salivary proline-rich protein precursor 


104 


38 


4208 


AF229126 


Homo sapiens 


acetylcholinesterase collagen-like tail 
subunit isoform VII 


96 


75 


4209 


AF090895 


Homo sapiens 


PRO0117 


142 


68 


4210 


X92485 


Plasmodium 
vivax 


pval 


104 


35 


4211 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


173 


73 


4212 


G02828 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6909. 


151 


69 


. 4213 


AF 115549 


Homo sapiens 


117' 1 jj All * 1 /~! 7 . • 

Wiskott-Aldrich Syndrome protein 


122 - 


43 


4214 


AF090944 


Homo sapiens 


PRO0663 


97 


39 


4215 


X12544 


Homo sapiens 


3 HLA-DR B protein precursor (AA - 
29 to 267) 


111 


60 


4216 


M69065 


human 
herpesvirus 2 


ORF1 


89 


38 


4217 


A T7AAA OnC 

Ar 090895 


Homo sapiens 


rJKOOl 17 


133 


78 


4218 


B0 1372 


Homo sapiens 


Neuron-associated protein. 


135 


46 


4zly 


Y oo24o 


Homo sapiens 


Human secreted protein HCHPF68, j 
SEQ ID NO: 163. \ 


71 


47 


4220 


U93567 


Homo sapiens 


p40 


170 


35 


4221 


D00570 


Mus muscuius 


open reading frame (251 AA) 


108 


28 


4222 


X97675 


Homo sapiens 


plakophilin 2b 


122 


47 


4223 


AF270937 


Plutella 
xylostella 


PxORF73 peptide 


99 


54 
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granulo virus 








4224 


a T71 1 AAO A 

Ar 1^0089 


Homo sapiens 


TlTO ^\o rra 

PRO2550 


144 


69 


4225 


G03786 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7867. 


125 


82 


4226 


U42471 


Mus musculus 


Wiscott-Aldrich Syndrome protein 
homo log 


99 


62 


/lOOT 

4227 


DA'7AC7 

K07057 


Homo sapiens 


Smaller hepatocellular oncoprotein 
(hhem) gene preoduct 


172 


59 


/lOO o 

4228 


W4&351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


161 


74 


4229 


AT A/1ACOT 

AL049537 


Homo sapiens 


jti i / Jim i /i_ c u™ a ■ _ v * v. • i j 

dJ 1 164110.1 (breteldin A- inhibited 
guanine nucleotide-exchange protein 2) 


212 


90 


4230 


AF1 18086 


Homo sapiens 


PRO 1992 


150 


69 


4231 


M81321 


Macaca 
fascicularis 


proline-rich protein 


130 


41 


4232 


G02514 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6595. 


138 


68 


4233 


Y01400 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HNHF029. 


132 


56 


4234 


G03798 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7879. 


1 10 


52 


4235 


R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


124 


32 


4236 


G02485 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

OjOO. 


1 19 


42 


4237 


VH'! /t AO 


Homo sapiens 


21 -Glutamic Aciu-Kicn Protein 


j68 


56 


4238 


R95913 


Homo sapiens 


Neural thread protein. 


139 


55 


4239 


GUU32o 


Homo sapiens 


riuman secreted pro tern, orsQ ID NO: 
4409. 


146 


63 


/lO yi A 

4240 


X97675 


Homo sapiens 


plakophilin 2b 


108 


78 ' 


4241 


Y36156 


Homo sapiens 


Human secreted protein #28. 


137 


65 


4242 


A T/^AAA^OC 

AK000^85 


Homo sapiens 


unnamed protein product 


131 


33 


4243 


AJ252253 


human 
herpesvirus 2 


glycoprotein G-2 


107 


36 


/lO /I /I 

4244 


Y99447 


Homo sapiens 


Human PRO 1556 (UNQ764) ammo 
acia sequence i>tsO ID N(J:j /z. 


597 


100 


4245 


Y05398 


Homo sapiens 


Human TIE ligand NL8 protein 
sequence. 


424 


84 


4Z40 


T AAA 1 


Homo sapiens 


urf4 


000 

LLL 


AA 

90 


4^4 / 


Ar 1 J4D /y 


Zea mays 


arabinogalactan protein 


1*5 A 

ij4 


jy 


4Z4o 


Z;J44oD 


Zea mays 


extensin-lilce protein 


\Ld 


3D 


4249 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


128 


62 


/lO ^A 




Plasmodium 
yoelii 


circumsporozoite protein 


1 1 A 


~t "> 
3D 


/lO ^ 1 
HAD 1 


AJsJJz44j J 


Homo sapiens 


rj_,juuu4 / protem 


1 1 £L 


65 


423Z 




Homo sapiens 


C-U(^4z Oirase-activating protein 


566 


93 


4253 


D38112 


Homo sapiens 


cytochrome c oxidase subunit 1 


449 


85 


/IOC /I 

4254 


TO 1 *5 C C /T 

R13556 


Homo sapiens 


Protein encoded downstream of hhc_M 
oncoprotein. 


111 


63 


4255 


Mo 1 143 


Bovine 
herpesvirus 1 


latency-related open reading frame 2; 
putative 


1 A 1 

101 


42 


4256 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


113 


43 


4Z J / 




Homo sapiens 


Human transcription regulator MOP6 
partial sequence. 


A 1 A 

410 


79 


4258 


Y36495 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 27. 


112 


61 


4259 


D90252 


Human 

papillomavirus 
type 5b 


E4 protein 


110 


32 
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% 

Jdentit 
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4260 


X90569 


Homo sapiens 


elastic titin 


660 


95 


4261 


Ar 0909^0 


Homo sapiens 


PKO0478 


1 14 


71 


4262 


AF1^>2209 


Homo sapiens 


prepro-major basic protein homolog 


422 


72 


4263 


AF043102 


Pneumocystis 
carinii 


surface glycoprotein A 


121 


29 


4264 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 

7071 

loll. 


132 


55 


426 j 


Ar (J9uo9:> 


Homo sapiens 


dt> r~\f\ 1 n 


1 c a 

159 


62 


4200 


Ar U9U9JU 


Homo sapiens 


TDT> HA A TO 


143 


50 


4zo / 




Homo sapiens 


Human secreted protem, b.bQ ID JNU: 

*70 1 A 


1 1 A 

119 


34 


42oo 


f /U494 


Homo sapiens 


Sequence of human B-cell growth 
idLtor ^rsL^or ^. 


1 A/C 

106 


61 


4269 


D23660 


Homo sapiens 


ribosomal protein 


493 


73 


4Z /U 


Ar i JUUoy 


Homo sapiens 


rivUzD jU 


1 1 A 

119 


*> A 

39 


42/1 


U93->o4 


Homo sapiens 


putative pi 50 


121 


37 


42 /2 


ArJUiJol!) 


Mus musculus 


phospholipase C-L2 


A O C 

4dJ 


79 


4273 


AF006082 


Homo sapiens 


Arp2 


488 


85 


4274 


Y 12293 


Mus musculus 


lun 


104 


33 


4275 


X54289 


Bos taurus 


cGMP-dependent protein kinase 
(isoform I beta) 


561 


81 


4276 


AF1 19855 


Homo sapiens 


PRO 1847 


155 


71 


4277 


J01415 


Homo sapiens 


MTND4 


372 


63 


4278 


AB021078 


Plasmid Collb- 
P9 


ybbA 


101 


30 


4279 


000328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 10 


73 


4280 


/—i rv ao o o 

000328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


1 10 


73 


4281 


A T? 1 1 AOC1 

AF1 19851 


Homo sapiens 


TVT> /~V 1 TOO 

PRO 1722 


1 10 


60 


4282 


D3 8116 


Pan paniscus 


NADH dehydrogenase subunit 1 


346 


92 


4283 


Y15908 


Homo sapiens 


DIA-12C protein 


109 


52 


4284 


Y30681 


Homo sapiens 


Splice variant ZAP- IB protein of the 
human tumor suppressor gene ZAP-1 . 


122 


76 


4285 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


121 


42 


4286 


A T/AO A A C C 

AK024455 


Homo sapiens 


FJLJ00047 protein 


135 


57 


4287 


Y11525 


Homo sapiens 


CCAAT/enhancer binding protein 
alpha 


100 


32 


4288 


AF033122 


Homo sapiens 


non-p53 regulated PA26-T1 nuclear 
protein 


128 


81 


4289 


W48351 


Homo sapiens 


Human breast cancer related protein 
130KB2. 


111 


39 


/loon 


W4o3M 


, Homo sapiens 


Human breast cancer related protein 
BCRB2. 


140 


58 


429 1 


A TO/tOCylA 

A J 242 J 4U 


Vol vox carteri f. 
naganensis 


hydroxyproline-rich glycoprotein DZ- 

lilxAjrJr 


132 


34 


4292 


X60432 


Zea mays 


prolin rich protein 


118 


42 


4Zyo 




Homo sapiens 


zinc finger protein ZNF136 


4ol 


60 


4294 


AF038960 


Homo sapiens 


SKD1 homolog 


146 


82 


4295 


A T 1 A A 1 1 /I 

AL3901 14 


Leishmania 
major 


extremely cysteine/valine rich protein 


128 


54 




UOO DO! 


Homo sapiens 


kruppel-related zinc finger protein 


CIO 
D 12 


on 

o9 


4297 


AF043706 


Caenorhabditis 
elegans 


contains similarity to granulins 


104 


55 


4298 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


138 


65 


4299 


AF152510 


Homo sapiens 


protocadherin gamma A3 short form 
protein 


520 


87 
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SEQ ID 

NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4300 


GO 1246 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
5327. 


142 


72 


4301 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
JO /u. 


109 


50 


4302 


A T A""> C A A f\ 


Arabidopsis 
thaliana 


putative protein 


103 


31 


a n f\i 
43 U 3 


JLUjOUU 


Homo sapiens 


— _ 

adenylyl cyclase 




96 


4304 


AF1 16712 


Homo sapiens 


PR02738 


140 


54 


4305 


JJ13757 


Homo sapiens 


amidophosphoribosyltransferase 


182 


92 


4306 


L20450 


Mus musculus 


DNA-binding protein 


470 


64 


4307 


AF213386 


Mus musculus 


A T^Vv 1*1* j . . • 

ATP-bindmg cassette protein 


175 


97 


4308 


X76850 


Mus musculus 


MAP kinase-activated protein kinase 2 


154 


69 


4309 


S801 19 


Rattus sp. 


reverse transcriptase homolog 


123 


55 


4310 


X67337 


Homo sapiens 


Human pre-mRNA cleavage factor I 68 
kDa subunit 


110 


32 


4311 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


96 


58 


4312 


Z46236 


Ovis aries 


keratinocyte growth factor 


210 


75 


4313 


AB020700 


Homo sapiens 


KIAA0893 protein 


569 


87 


4314 


AF124727 


Homo sapiens 


acinusS 


498 


88 


4315 


W88627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


88 


32 


4316 


AL 160493 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


122 


53 


4317 


AF 119900 


Homo sapiens 


PR02822 


135 


58 


4318 


U74612 


Homo sapiens 


hepatocyte nuclear factor-3/fork head 
homolog 1 1 A 


104 


70 


4319 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


117 


51 


4320 


AB026054 


Homo sapiens 


brain finger protein 


412 


90 


4321 


AF090944 


Homo sapiens 


PRO0663 


133 


71 


4322 


AF165513 


Homo sapiens 


vacuolar protein sorting 45 isoform 


712 


97 


4323 


Y99418 


Homo sapiens 


Human PR01317 (UNQ783) amino 
acid sequence SEQ ID NO:277. 


532 


97 


4324 


AF 163772 


Leishmania 
major 


7138.7 


142 


37 


4325 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


128 


59 


4326 


W40054 


Homo sapiens 


P3 00/CBP-associated transcriptional 
cofactor P/CAF C-terminus. 


175 


94 


4327 


AF2174H 


Homo sapiens 


neurohgin 3 isoform HNL3 


549 


84 


4328 


U94836 


Homo sapiens 


ERPROT 213-21 


175 


87 


4329 


Y20763 


Homo sapiens 


Human neurofilament-M mutant 
protein fragment 45. 


501 


87 


4330 


AB037745 


Homo sapiens 


KIAA1324 protein 


1014 


99 


4331 


M91563 


Rattus 
norvegicus 


NMDA receptor subtype 2C 


116 


39 


4332 


G03704 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7785. 


286 


76 


4333 


AC006841 


Arabidopsis 
thaliana 


Mutator-like transposase 


130 


70 


4334 


AP000373 


Arabidopsis 
thaliana 


jasmonate inducible protein; 
myrosinase binding protein-like 


130 


53 


4335 


U22961 


Homo sapiens 


similar to human albumin, Swiss-Prot 
Accession Number P02768; Method: 
conceptual translation supplied by 
author 


377 


90 


4336 


AJ277426 


Globodera 
pallida 


putative cuticular collagen 


112 


37 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4337 


U 16802 


Rattus 
norvegicus 


Ca2+-dependent activator protein; 
calcium-dependent actin-binding 
protein 


560 


79 


4338 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


118 


56 


4j>39 


A DAO A COO 


Homo sapiens 


notch4 


9j> 


33 


4j>4U 


a rno'i A/T 
Ar 12d4uo 


Homo sapiens 


nuclear prelamin A recognition factor 


241 


100 


4341 


W19771 


Homo sapiens 


Beta-1 integrin modulator B171 . 


169 


100 


4342 


AL096828 


Homo sapiens 


dJ963E22. 1 (Novel protein similar to 

>T\r T>TTKT O A „-*.*, \ 

N Y -KbN -2 Antigen) 


666 


88 


4343 


D503 12 


Homo sapiens 


uKA IP- 1 


542 


80 


4344 


T T O A A OT 

U84487 


Homo sapiens 


CX3C chemokme precursor 


371 


78 


4345 


U93569 


Homo sapiens 


putative pi 50 


111 


47 


4346 


AL161755 


Streptomyces 
coelicolor A3 (2) 


putative serine/threonine protein kinase 


102 


30 


4347 


AK026162 


Homo sapiens 


unnamed protein product 


377 


98 


4348 


AF221759 


Homo sapiens 


Maml 


314 


47 


4349 


AF 165926 


Homo sapiens 


NUP155 


147 


84 


4351 


U50185 


Rattus 
norvegicus 


PP-1M 


144 


52 


4352 


D50455 


Rattus 
norvegicus 


phodpholipase C delta4 


196 


70 


4353 


X54131 


Homo sapiens 


protein-tyrosine phosphatase 


261 


87 


4354 


AF151850 


Homo sapiens 


CGI-92 protein 


255 


92 


4355 


G03996 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8077. 


105 


95 


4356 


U93564 


Homo sapiens 


putative pi 50 


171 


72 


4357 


AP001507 


Bacillus 
halodurans 


unknown conserved protein 


159 


34 


4358 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


153 


60 


4359 


AF072697 


Mus musculus 


SHYC 


177 


97 


4360 


G03789 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7870. 


129 


69 


4361 


M98776 


Homo sapiens 


keratin 1 


449 


77 


4362 


AK024436 


Homo sapiens 


FLJ00026 protein 


671 


91 


4363 


W 8 8627 


Homo sapiens 


Secreted protein encoded by gene 94 
clone HPMBQ32. 


155 


49 


4364 


D13896 


Rattus 
norvegicus 


cytoplasmic dynein heavy chain 


327 


92 


4365 


X79389 


Homo sapiens 


glutathione transferase Tl 


164 


96 


4366 


AB006458 


Mus musculus 


alpha-D-mannosidase 


177 


56 


4367 


X01060 


Homo sapiens 


put. transferrin receptor (aa 1-760) 


160 


96 


4368 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


103 


54 


4369 


AL353715 


Homo sapiens 


bK3 1 84A7 .3 . 1 (helicase-like protein 
NHL) 


485 


100 


4370 


AF001631 


Oryctolagus 
cuniculus 


glucose-regulated protein GRP94 


118 


92 


4371 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


113 


80 


4372 


AL121741 


Schizosaccharom 
yces pombe 


putative negative regulator of vesicle 
formation 


200 


41 


4373 


M96982 


Homo sapiens 


U2 snRNP auxiliary factor small 
subunit 


279 


70 


4374 


U47741 


Homo sapiens 


CREB-binding protein 


123 


96 


4375 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW1 8. 


127 


75 


4376 


Y12781 


Homo sapiens 


trans ducin (beta) like 1 protein 


1056 


90 
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NO: 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4377 


W69424 


Homo sapiens 


Human secreted protein bg!40_l . 


183 


97 


4378 


AL096700 


Homo sapiens 


dJ499B10.2 (phosphorylase kinase, 
alpha 2 (liver) (PYK)) 


639 


86 


4379 


AF177390 


Manduca sexta 


antennal specific membrane protein 
AMP 


378 


51 


4380 


X97675 


Homo sapiens 


plakophilin 2b 


156 


75 


43S1 


R33713 


Homo sapiens 


PgllOl. 


104 


100 


4382 


AB015473 


Arabidopsis 
thaliana 


gene_id:MCM23.1~uriknown protein 


113 


61 


4383 


AF1 16715 


Homo sapiens 


PR02829 


133 


50 


4384 


AL357472 


Homo sapiens 


VPS33B 


676 


99 


4385 


AF090931 


Homo sapiens 


PRO0483 


155 


58 


4386 


U92645 


Gecarcinus 
lateralis 


alpha- 1 -tubulin 


511 


75 


4387 


Y36095 


Homo sapiens 


Extended human secreted protein 
sequence, SEQ ID NO. 480. 


202 


90 


4388 


M64923 


Bos taurns 


C10 protein 


218 


95 


4389 


X98264 


Homo sapiens 


M-phase phosphoprotein 4 


197 


100 


4390 


AK000264 


Homo sapiens 


unnamed protein product 


239 


73 


4391 


M12937 


Homo sapiens 


ferritin heavy subunit 


662 


84 


4392 


P92219 


Homo sapiens 
(human) 


CR1 protein. 


116 


80 


4393 


X16135 


Homo sapiens 


L protein (AA 1-558) 


759 


99 


4394 


G00397 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4478. 


148 


58 


4395 


AC005591 


Homo sapiens 


PkB-like 


170 


97 


4396 


AF161426 


Homo sapiens 


HSPC308 


313 


77 


4397 


Y02671 


Homo sapiens 


Human secreted protein encoded by 
gene 22 clone HMSJW18. 


146 


61 


4398 


AF 2 04921 


Homo sapiens 


succinyl-CoA synthetase alpha subunit 


679 


88 


4399 


AF257330 


Homo sapiens 


COBW-like protein 


586 


90 


4400 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


116 


65 


4401 


AF1 19851 


Homo sapiens 


PRO 1722 


101 


79 


4402 


M55542 


Homo sapiens 


guanylate binding protein isoform I 


230 


76 


4403 


Y07752 


Volvox carteri 


pherophorin-S 


459 


88 


4404 


W48351 


Homo sapiens 


Human breast cancer related protein 
BCRB2. 


113 


44 


4405 


U07786 


Sus scrofa 


beta actin 


519 


90 


4406 


A06977 


Homo sapiens 


albumin 


586 


88 


4407 


L20755 


Cuscuta reflexa 


hybrid proline-rich protein; cytokinin- 
induced;haustoria 


112 


41 


4408 


AB002299 


Homo sapiens 


KIAA0301 


612 


98 


4409 


AB001424 


Mus museums 


KIF17 


104 


47 


4410 


M88108 


Homo sapiens 


p62 


574 


92 


4411 


AL121673 


Homo sapiens 


bA305P22.2 (novel protein) 


415 


71 


4412 


AF064553 


Mus musculus 


NSD1 protein 


224 


64 


4413 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


107 


36 


4414 


A06977 


Homo sapiens 


albumin 


505 


82 


4415 


A06977 


Homo sapiens 


albumin 


596 


90 


4416 


Y87116 


Homo sapiens 


Human secreted protein sequence SEQ 
ID NO: 155. 


123 


50 


4417 


AL359782 


Trypanosoma 
brucei 


possible (hhv-6) ul 102, variant a dna, 
complete virion genome. 


103 


85 


4418 


G03053 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7134. 


100 


54 


4419 


AF049606 


Mus musculus 


transcription factor NF-ATc isoform b 


114 


90 
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SEQ ID 
NO: 


Accession No. 


H 


Desrrinrinn 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4420 


Z11190_cdl 


Homo sapiens 


1 1 -DEC- 1998 Interleukin-3 coding 
sequence from b2HFLS20W cDNA 
library. 


354 


97 


4421 


W64469 


Homo sapiens 


Human secreted protein from clone 
CW795_2. 


203 


100 


4422 


M12523 


Homo sapiens 


alloalbumin Venezia 


350 


94 


4423 


AF 130077 


Homo sapiens 


PR02619 


561 


89 


4424 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


I6l 


64 


4425 


Y91577 


Homo sapiens 


Human secreted protein sequence 
encoded by gene 2 SEQ ID NO:250. 


393 


91 


4426 


A09561 


synthetic 
construct 


human serum albumin 


651 


96 


4427 


AL133215 


Homo sapiens 


bA108L7.5 (novel protein similar to 
Plasmodium POM1 and C. elegans 
F46G11.1 (Tr:Q20485)) 


392 


100 


4428 


W33663 


Homo sapiens 


Human puromycin-sensitive 
aminopeptidase (PSA)-68. 


172 


96 


4429 


AB021654 


Homo sapiens 


DD2/biie acid-binding 
protem/AKRl C2/3 alpha- 
hydroxysteroid dehydrogenase type 3 


184 


81 


4430 


W63683 


Homo sapiens 


Human secreted protein 3. 


114 


42 


4431 


AY008763 


Homo sapiens 


sentrin/SUMO-specific protease 


447 


96 


4432 


U52965 


Homo sapiens 


ENX-1 


176 


94 


4433 


AF 180470 


Mus museums 


Kiaa0575 


423 


77 


4434 


XI 7206 


Homo sapiens 


put. LLRep3 protein (AA 1-221) 


581 


99 


4435 


Y70929 


Homo sapiens 


Human zilla4 splice variant protein. 


621 


100 


4436 


G00416 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4497. 


135 


71 


4437 


A06977 


Homo sapiens 


albumin 


610 


97 


4438 


A00279 


synthetic 
construct 


Human serum albumin 


621 


94 


4439 


L29028 


Chlamydomonas 
eugametos 


amino acid feature: N-glycosylation 
sites, aa 41 43, 46 .. 48, 51 .. 53, 72 
74, 107 .. 109, 128 130, 132 134, 
158 .. 160, 163 .. 165; amino acid 
feature: Rod protein domain, aa 169 .. 
340; amino acid feature: globular 
protein domain, aa 32 168 


105 


36 


4440 


AK024455 


Homo sapiens 


FLJ00047 protein 


112 


65 


4441 


U49973 


Homo sapiens 


ORF1; MER37; putative transposase 
similar to pogo element 


133 


69 


4442 


G02872 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6953. 


142 


60 


4443 


AF1 13685 


Homo sapiens 


PRO0974 


101 


54 


4444 


W03627 


Homo sapiens 


Human follicle stimulating hormone 
GPR N-terminal sequence. 


181 


42 


4445 


G03790 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7871. 


132 


76 


4446 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


141 


54 


4447 


L02867 


Homo sapiens 


paraneoplastic antigen 


136 


78 


4448 


D38435 


Homo sapiens 


homologue of yeast PMS l 


108 


66 


4449 


G00328 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4409. 


122 


65 


4450 


AF010144 


Homo sapiens 


neuronal thread protein AD7c-NTP 


142 


68 


4451 


AF220264 


Homo sapiens 


MOST-l 


101 


40 


4452 


AF130079 


Homo sapiens 


PR02852 


97 


59 


4453 


AF116715 


Homo sapiens 


PR02829 


139 


69 



246 



WO 01/64835 



PCT7US0 1/04927 



SEQ ID 
NO; 


Accession No. 


Species 


Description 


Smith- 
Waterman 
Score 


% 

Identit 

y 


4454 


G03807 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7888. 


144 


53 


4455 


AF130087 


Homo sapiens 


PR02411 


108 


73 


4456 


AF279891 


Homo sapiens 


dead box protein 15 


108 


58 


4457 


L08258 


Strongylocentrot 
us purpuratus 


kinesin light chain isoform 4 


176 


94 


4458 


G00412 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4493. 


116 


78 


4460 


G03172 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7253. 


146 


74 


4461 


G03801 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7882. 


156 


52 


4462 


G02607 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6688. 


118 


72 


4463 


AF 119900 


Homo sapiens 


PR02822 


144 


80 


4464 


M15530 


Homo sapiens 


B-cell growth factor 


92 


76 


4465 


Y17833 


Human 
endogenous 
retrovirus K 


env protein 


107 


62 


4466 


R96800 


Homo sapiens 


Human histiocyte-secreted factor HSF. 


132 


78 


4467 


AF090895 


Homo sapiens 


PRO0117 


112 


79 


4468 


G04072 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
8153. 


146 


64 


4469 


X97675 


Homo sapiens 


plakophilin 2b 


119 


78 


4470 


AF1 18081 


Homo sapiens 


PRO 1900 


119 


74 


4471 


G03564 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7645. 


121 


60 


4472 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


137 


63 


4473 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


205 


73 


4474 


AF130114 


Homo sapiens 


PR02459 


121 


67 


4475 


AF178534 


Homo sapiens 


talin 


213 


67 


4476 


Y27907 


Homo sapiens 


Human secreted protein encoded by 
gene No. 119. 


122 


92 


4477 


Y01158 


Homo sapiens 


Secreted protein encoded by gene 1 8 
clone HCACJ8L 


99 


66 


4478 


Y27868 


Homo sapiens 


Human secreted protein encoded by 
gene No. 107. 


142 


62 


4479 


G00689 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4770. 


153 


55 


4480 


AF2 17374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


130 


100 


4481 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


110 


57 


4482 


Y20983 


Homo sapiens 


Human glial fibrillary acidic protein 
GFAP wild type fragment 9. 


110 


63 


4483 


AF221552 


Oryza sativa 


proline-rich protein RiP-15 


119 


33 


4484 


L25941 


Homo sapiens 


integral nuclear envelope inner 
membrane protein 


110 


84 


4485 


AF130089 


Homo sapiens 


PRO2550 


161 


81 


4486 


U39742 


Caenorhabditis 
elegans 


coded for by C. elegans cDNA 
yk25e5.3; coded for by C. elegans 
cDNA yk25e5.5; similar to repeat 
guanylate kinase domain of D. 
melanogaster lethal(l) discs large- 1 
tumor suppressor protein 
(SP:DLGl_DROME, P31007) andR. 
norvegicus postsynaptic density protein 


98 


72 
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95 (PSD-95) (SP:PSD9_RAT, P31016) 






4487 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


89 


81 


4488 


X97675 


Homo sapiens 


plakophilin 2b 


127 


88 


4489 


G03714 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7795. 


105 


71 


4490 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


160 


64 


4491 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


122 


77 


4492 


U83280 


Leishmania 
donovani 


39 kDa antigen 


107 


76 


4493 


AF023142 


Homo sapiens 


pre-mRNA splicing SR. protein rA4 


141 


40 


4494 


K02576 


Homo sapiens 


salivary proline-rich protein 1 


119 


40 


4495 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


98 


45 


4496 


AF130089 


Homo sapiens 


PRO2550 


132 


75 


4497 


AF090930 


Homo sapiens 


PRO0478 


160 


76 


4498 


AF1 19900 


Homo sapiens 


PR02822 


158 


55 


4499 


AF1 16661 


Homo sapiens 


PR01438 


126 


44 


4500 


AF1 16661 


Homo sapiens 


PR01438 


118 


42 


4501 


AL 160371 


Leishmania 
major 


probable (hhv-6) ul 102, variant a 
DNA, complete virion genome 


113 


48 


4502 


M81321 


Macaca 
fascicularis 


proline-rich protein 


154 


44 


4503 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


154 


96 


4504 


D44596 


Saccharomyces 
cerevisiae 


Mdjlp heat shock protein 


93 


42 


4505 


Y86248 


Homo sapiens 


Human secreted protein RCHPF68, 
SEQ ID NO: 163. 


163 


76 


4506 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


207 


93 


4507 


U93564 


Homo sapiens 


p40 


520 


86 


4508 


Y02999 


Homo sapiens 


Fragment of human secreted protein 
encoded by gene 121. 


135 


55 


4509 


G03043 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7124. 


109 


77 


4510 


G00637 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
4718. 


146 


66 


4511 


L27428 


Homo sapiens 


reverse transcriptase 


143 


87 


4512 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO:163. 


105 


80 


4513 


G03787 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
7868. 


97 


75 


4514 


Y86248 


Homo sapiens 


Human secreted protein HCHPF68, 
SEQ ID NO: 163. 


128 


60 


4515 


AP000616 


Oryza sativa 


similar to RING-H2 finger protein 
RHAla (AF078683) 


120 


73 


4516 


AF130051 


Homo sapiens 


PRO0898 


141 


86 


4517 


U12690 


Homo sapiens 


cytochrome oxidase subunit II 


185 


94 


4518 


G02597 


Homo sapiens 


Human secreted protein, SEQ ID NO: 
6678. 


130 


100 


4519 


AK024455 


Homo sapiens 


FLJ00047 protein 


152 


68 


4520 


AF2 10651 


Homo sapiens 


NAG 18 


161 


88 


4521 


Y02693 


Homo sapiens 


Human secreted protein encoded by 
gene 44 clone HTDAD22. 


140 


68 


4522 


AF217374 


Acanthaster 
planci 


cytochrome oxidase subunit I 


131 


76 



248 



